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DOMAINS OF EXTRACELLULAR REGION OF HUMAN 
5 PLATELET DERIVED GROWTH FACTOR RECEPTOR POLYPEPTIDES 

► 

FIELD OF THE INVENTION 
The present invention relates to receptors for growth 
factors, particularly to human platelet-derived growth factor 

10 receptors (hPDGF-R) . More particularly , it provides various 
composite constructs of human platelet-derived growth factor 
receptors, these constructs retaining ligand binding regions 
found in the natural extracellular region of the receptors. It 
also provides recombinant nucleic acids encoding these 

15 polypeptides, typically also comprising a promoter for 
expression, and fusion peptides on the amino or carboxy 
terminus of the expressed extracellular composite structure • 
Antibodies are provided which recognize epitopes containing 
amino acids contained in different domains of the extracellular 

20 region. Cells comprising these polypeptides and nucleic acids, 
and diagnostic uses of these reagents are also provided. 

BACKGROUND OF THE INVENTION 
Polypeptide growth factors are mitogens that act on 

25 cells by specifically binding to receptors located on the cell 
plasma membrane. The platelet-derived growth factor (PDGF) 
stimulates a diverse group of biochemical responses, e.g., 
changes in ion fluxes, activation of various kinases, 
alteration of cell shape, transcription of various genes, and 

30 modulation of enzymatic activities associated with phospholipid 
metabolism. See, e.g., Bell et al. (1989) "Effects of Platelet 
Factors on Migration of Cultured Bovine Aortic Endothelial and 
Smooth Muscle Cells," Circulation Research 65:1057-1065. 

Platelet-derived growth factors are found in higher 

35 animals, particularly in warm blooded animals, e.g., mammals. 
In vitro, PDGF is a major polypeptide mitogen in serum for 
cells of mesenchymal origin such as fibroblasts, smooth muscle 
cells, and glial cells. In vivo, PDGF does not normally 
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Circulate freely in blood, but is stored in the alpha granules 
of circulating blood platelets . During blood clotting and 
platelet adhesion the granules are released, often at sites of 
injured blood vessels, thereby implicating PDGF in the repair 
of blood vessels. PDGF may stimulate migration of arterial 
smooth muscle cells from the medial to the intimal layer of the 
artery where the muscle cells may proliferate. This is likely 
to be an early response to injury. 

PDGF has also been implicated in wound healing, in 
atherosclerosis, in myeloproliferative disease, and in 
stimulating genes associated with cancerous transformation of 
cells, particularly c-myc and c-fos.. 

The platelet-derived growth factor is composed of two 
homologous polypeptide chains; it is a dimer of 16 kilodalton 
proteins which are disulfide connected. These polypeptides are 
of two types, the type B chain and the type A chain. Three 
forms of the growth factor dimer are found corresponding to a 
homodimer of two type A chains, a homqdimer of two type B 
chains,, and a heterodimer of the type A chain with the type B 
chain. Each of these three different combinations is referred 
to as a PDGF isoform. See, for a review on PDGF, Ross et al. 
(1986) "The Biology of Platelet^ Derived Growth Factor," .Cell. 
46:155-169. The growth factor sequences from mouse and human 
are highly homologous. 

The PDGF acts by binding to the platelet-derived 
growth factor receptor (PDGF-R) . The receptor is typically 
found on cells of mesenchymal origin. The functional receptor 
acts while in a form comprising of two transmembrane 
glycoproteins, each of which is about 180 kilodaltons. Two 
different polypeptides have been isolated, a type B receptor 
polypeptide and a type A receptor polypeptide. 

A sequence of a type B receptor polypeptide of the 
• mouse platelet-derived growth factor receptor polypeptide is 
published in Yarden et al. (1986) Nature 323:226-232. A 
sequence of an type A human platelet-derived growth factor 
receptor (hPDGF-R) polypeptide is disclosed in ttatsui et al. 
(1989) Science 243: 800-803. 
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These PDGF receptors usually have three major 
identifiable regions. The first is a transmembrane region (TM) 
which spans the plasma membrane once, separating the regions of 
the receptor exterior to the cell from the regions interior to 
5 the cell. The second region is an extracellular region (XR) 
which contains the domains that bind the polypeptide growth 
factor (i.e., the ligand binding domains). The third is an 
intracellular region (IR) which possesses a tyrosine kinase 
activity. This tyrosine kinase domain is notable in having an 

10 insert of about 100 amino acids, as compared with most other 
receptor tyrosine kinase domains which are contiguous or have 
shorter insert segments. 

The complete sequences of the human type B and human 
type A receptor polypeptides are reported elsewhere, e.g., 

15 U.S. S.N. 07/309,322, which is hereby incorporated herein by 

reference. However, for many purposes, a smaller or less than 
full length functional protein would be desired. For example, 
smaller molecules may be more easily targeted to areas of 
compromised circulation, or present fewer epitopes or 

20 extraneous domains unrelated to various activities of interest. 
Functional analogues with a slightly modified spectrum of 
activity, or different specificity would be very useful. 

. Thus, the use of new composite constructs exhibiting 
biological activity in common with platelet-derived growth 

25 factor receptor polypeptides will have substantial use as 
research, reagents, diagnostic reagents, and therapeutic 
reagents. In particular, the identification of important 
polypeptide features in the extracellular region of the 
platelet-derived growth factor receptor polypeptides will allow 

30 substitutions and deletions of particular features of the 

domains. Moreover, use of an in vitro assay system provides 
the ability to test cytotoxic or membrane disruptive compounds. 



t 
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SUMMARY OF THE INVENTION 
In accordance with the present invention, defined 
constructs of modified human platelet-derived growth factor 
receptor polypeptides are provided . Extracellular region 
5 domain structures are identified and modifications and 
combinatorial rearrangements of the receptor segments are 
furnished. Both cell bound and soluble forms of modified 
segments are made available, as are methods for assays using 
them, thereby allowing for screening of ligand analogues. 

10 The present invention provides a platelet-derived 

growth factor receptor (hPDGF-R) fragment of between about 8 
and 400 amino acids comprising one or more platelet-derived 
growth factor (PDGF) ligand binding regions (LBR's) from 
extracellular domains Dl, D2, or D3, wherein the fragment binds 

15 a platelet-derived growth factor ligand. Generally, the 
fragment will exhibit a binding affinity of about 5 nM or 
better and will have a sequence of at least about 6 or 8 
contiguous amino acids, preferably at least about 15 or -more 
contiguous amino acids from a domain D3 intra-cysteine region. 

20 The fragment will often lack a transmembrane region. In other 
embodiments, the fragment is soluble, is substantially pure, or 
has at least one ligand binding region derived from a domain 
D3. The fragment may be derived from a type B, or from a type 
A PDGF-R LBR fragment, e.g., from Table 1 or Table 2. In 

25 particular embodiments, the fragment is selected from the group 
of formulae consisting of: 

a) Xa-Dm-Xc; 

b) Xa-Dm-Xl-Dn-Xc; 

c) Xa-Dm-Xl-Dn-X2-Dp-Xc ; and 
30 d) Xa-Dm-Xl-Dn-X2-Dp-X3-Dq-Xc; 

e) Xa-Dm-Xl-Dn-X2-Dp-X3-Dq-X4-Dr-Xc; 
where the fragment is not D1-D2-D3-D4-D5; 

each of Xa, XI, X2, X3, and Xc is, if present, a 
polypeptide segment lacking a D domain? and 
35 each of Dm, Dn, Dp, and Dq is, independently of one 

another, selected from the group consisting of Dl, D2, D3, D4, 
and D5. Preferred fragments are selected from the group 
consisting of: 
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a) D1-D2-D3 or D3-D4-D5; and 

b) . D1-D2-D3-D4 or D2-D3-D4-D5. 

The present invention also embraces a soluble human 
platelet-derived growth factor receptor (hPDGF-R) fragment of 
5 between about 10 and 350 amino acids comprising at least one * 
platelet-derived growth factor (PDGF) ligand binding region 
(LBR) from a domain D3, wherein the fragment specifically binds 
to a platelet-derived growth factor ligand* Usually the 
fragment comprises a sequence of at least about 15 contiguous 

10 amino acids from the intra-cysteine portion of domain D3 and 

has a binding affinity of better than about 5 nM. Other useful 
fragment embodiments will be soluble, substantially pure, or a 
type B or type A PDGF-R LBR, e.g., from Table 1 or Table 2. 

The invention also includes nucleic acid sequences, 

15 including those encoding the above described polypeptide 

fragments. Often the nucleic acid sequences - incorporate a 
promoter, generally operably linked to the sequence encoding 
the fragments. 

Cells comprising the nucleic acids or peptides of the 
20 invention are also embraced. In particular cell embodiments, 
the cell will be a mammalian cell, and often will contain both 
a nucleic acid and a protein expression product of the nucleic 
acid. 

The compositions described above provide antibodies 
25 which recognize an. epitope of a described PDGF-R fragment, but 
not a natural PDGF-R epitope. The antibody will often be a 
monoclonal antibody. 

The present invention also provides a method for 
measuring the PDGF receptor binding activity of a biological 
30 sample comprising the steps of: 

a) contacting an aliquot of a sample to a PDGF ligand in 
the presence of a described PDGF-R fragment in a first 
analysis; 

b) contacting an aliquot of the sample to a PDGF ligand 
35 . in the absence of the PDGF-R fragment in a second analysis? and 

c) comparing the amount of binding in the two analyses. 
In some instances, the PDGF-R fragment is attached to a cell, 
or a solid substrate, e.g., a microtiter dish. 
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The invention also embraces a method for measuring 
the PDGF ligand content of a biological sample comprising the 
steps of: 

a) contacting an aliquot of the sample to a ligand 
5 binding region (LBR) in the presence of a described PDGF-R 

fragment in a first analysis; 

b) contacting an aliquot of the sample to a LBR in the 
absence of the PDGF-R fragment in a second analysis? and 

c) comparing the amount of binding in the two analyses. 
10 In some embodiments, the contacting steps are performed 

simultaneously. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 illustrates a strategy for oligonucleotide 
15 directed in vitro deletion mutagenesis of soluble hPDGF-R 
extracellular domains. Many of these constructs will be 
soluble peptides, or can be modified to be such. 
The abbreviations used are; 

PR = PDGF-R; intact 
20 P - . PDGF-R; extracellular region 

TM = transmembrane 
K = kinase 
S = signal sequence 
Fig. 2 illustrates the structure of a plasmid derived 
25 form pcDL-Sa296 used for expressing various deletion 
polypeptides. 

Fig. 3 illustrates the structure of a plasmid pBJA 
derived from pcDLa296. See Takabe et al. (1988) Mol. Cell. 
Biol. 8:466-472. 
30 1. The pcDL-SRa296 is cut with Xhol. 

2. A polylinker (Xhol-Xbal-Sf il-Notl-EcoRI- 
EcoRV-Hindlll-Clal-Sall) is inserted into 
the Xhol cut vector. 

3. Sail is compatible with the Xhol site; and 
35 generates both a Sail and an Xhol site. 

4. The SV40 16s splice junction is no longer 
present. 
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Fig. 4 illustrates the inhibition of receptor 
phosphorylation by a human type B PDGF receptor polypeptide* 
Labeling with a reagent which binds to phosphorylated tyrosine 
shows that phosphorylation activity is decreased in the 
5 presence of the receptor polypeptide fragment. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

!♦ General Description 
A. PDGF-R 
10 l. structural features 

a. extracellular domain (XR) 

i. signal sequence 

ii. D domains (Ig-like) 

b. transmembrane segment (TM) 
15 c. intracellular domain (IR) 

i. tyrosine kinase 

ii. insert 
2. function 

a. bind ligands (PDGF analogues) 
20 b. tyrosine kinase activity 

c. bind to PDGF-R peptide (dimer 
formation) 

d. phosphorylated segments 
B. Physiological Functions 

25 1. cellular 

2. tissue differentiation 

3 . organismal 

II. Polypeptides 

A. D domains 

30 1. 0-sheet strands 

2. cysteine residues 

B. Soluble Forms, extracellular region 

C. Truncated/Deletion Forms 

D. Fusion Proteins 

35 Genetic Variants (site-directed rautagenized) 

F. Compositions Comprising Proteins 

III. Nucleic Acids 

A. Isolated Nucleic Acids 

B. Recombinant Nucleic Acids 

40 c. Compositions Comprising Nucleic Azids 

IV. Methods for Making PDGF-R Constructs 
A. Protein Purification 

1. affinity with derivatifced PDGF 

2. various ligands, same receptor 
45 B. Expression of Nucleic Acids 

C. Synthetic methods 

V. Antibodies 

VI. Methods for Use 
A. Diagnostic 

50 B. Therapeutic 

* * * 
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I. General Description 

A. Platelet-derived growth factor receptor (PDGF-R) 
The human platelet-derived growth factor receptor 
(hPDGF-R) typically comprises two polypeptides. These 
5 polypeptides , which may be identical or only slightly 

different, associate during the functional activities of ligand 
binding and transducing of the ligand binding signal into the 
cell. 

The platelet-derived growth factor receptor was 

10 identified as having a major component of an approximately 180 
kilodalton protein which, is glycosylated. This glycoprotein 
was identified as a platelet-derived growth factor receptor 
polypeptide. Primary structures of two homologous forms of 
polypeptides have been reported. A type B receptor nucleic 

15 acid and its corresponding polypeptide sequence from mouse are • 
reported in Yarden et al. (1986) Nature 323: 226-232; and a 
homologous genetic sequence has been isolated from humans. See 
U.S. S.N. 07/309,322. A human type A receptor sequence is 
reported in Matsui et al. (1989) Science 243: 800-803. 

20 Although the two different forms of the receptor polypeptides 
are homologous, they are encoded by two separate genes. 

The functional receptor apparently involves a dimer 
of these polypeptides, either homodimers of the type B receptor 
polypeptide or of the type A receptor polypeptide, or a 

25 heterodimer of the type B receptor polypeptide with an type A 
receptor polypeptide. The specificity of binding of each of 
these forms of the receptor is different for each of the 
different forms of platelet-derived growth factor (PDGF) , the 
AA, BB, or AB forms (from either mouse or human, or presumably 

30 other mammals) . 

The PDGF-R is a member of a family of related 
receptors. See, e.g., Yarden et al. supra . Each of these 
receptor polypeptides has a hydrophobic membrane spanning 
region (TM for transmembrane) , a large extracellular region 

35 . (XR) with regularly spaced cystine residues, and a cytoplasmic 
intracellular region (IR) having intracellular tyrosine kinase 
activity. The XR of the PDGF-R has a predicted structure 
containing 5 p -strand-rich immunoglobulin- (Ig)-iike domains. 
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Each of these Ig-like domains consists of about 100 amino 
acids, ranging more specifically from about 88 to about 114 
amino acids, and, except for the fourth domain, contains 
regularly spaced cysteine residues. Many of the structural 
5 features of the various growth factor receptors are homologous, 
including the mouse and human versions of the PDGF-R. Thus, 
many of the structural features defined herein are shared with 
other related proteins. However, in most cases, the functional 
relationship to particular structural features is unknown. 
.10 The intracellular region (IR) is that segment of the 

PDGF-R which is carboxy proximal of the transmembrane (TM) 
segment. The intracellular region is characterized, in part, 
by the presence of a split tyrosine kinase structural domain. 
In the human type B receptor polypeptide, the tyrosine kinase 
15 domain is about 244 amino acids with an insert of about 104 
amino acids. See Table 1. In the human type A receptor 
polypeptide, the domain is about 244 amino acids long with a 
kinase insert of about 103 amino acids. See Table 2. 
Functionally, this domain is defined, in part, by its tyrosine 
20 kinase activity, typically modulated by ligand binding to 

binding sites found in the extracellular region, and appears to 
function in a dimer state. The substrate for phosphorylation 
includes various tyrosine residues on the accompanying receptor 
polypeptide chain, and other proteins which associate with the 
25 receptor. The tyrosine kinase domain is also defined, in part, 
by its homology to similar domains in other tyrosine kinase 
activity containing proteins. See, e.g.,. Yarden et ah (1986) 
Nature 323:226-232. Each IR segment of the dimerized receptor 
complex appears to phosphorylate specific tyrosine residues on 
30 the other polypeptide chain. 

Each transmembrane segment of the human receptor 
polypeptides is about 24 or 25 amino acids long and is 
characterized by hydrophobic amino acid residues. These 
segments have sequences characteristic of membrane spanning 
35 segments. In the human type B receptor polypeptide the 
transmembrane region appears about 25 amino acids long 
extending from about val(500) to trp(524), while in the human 
type A receptor polypeptide, the transmembrane segment appears 
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to be about 24 amino acids extending from about leu (502) to 
trp(526). See, e.g., Claesson-Welsh et al. (1989) Proc. flat'l 
ftead. Sci. USA . 86:4917-4921, 

A polypeptide or nucleic acid is a "human" sequence 

5 if it is derived from, or originated in part from, a natural 
human source. For example, proteins derived from human cells, 
or originally encoded by a human genetic sequence, will be 
human proteins. A sequence is also human if it is selected on 
the basis of its high similarity to a sequence found in a 
10 natural human sample, or is derived therefrom. 

A fusion polypeptide or nucleic acid is a molecule 
which results from the fusion of segments from sequences which 
are not naturally in continuity with one another. Thus, a 
chimeric protein or nucleic acid is a fusion molecule. A 

15 heterologous protein is a protein originating from a different 
source. . . 

B. Physiological Functions 

The PDGF-R appears to have at least four major 
20 different biological functions. The first is the binding of 
ligands, usually the PDGF mitogenic proteins or their 
analogues. These ligands and analogues may also serve as 
either agonists or antagonists . The ligand binding sites, made 
up of ligand binding regions (LBR's), are localized in the 
25 extracellular region (XR) . The functional receptor transduces 
a signal in response to ligand binding, and the resulting 
response is a ligand modulated activity. As the likely ligand 
is a PDGF, or an analogue, the signal will ordinarily be PDGF 
modulated. 

30 A second biological activity relates to the tyrosine 

kinase enzymatic activity. This activity is typically 
activated intracellular ly in response to ligand binding. 
However, since these receptors apparently function in a dimeric 
state, the interchain binding interactions may be considered a 

35 third biological activity which* may be mediated by blocking 
agents. Blocking ox interference with the dimerization 
interactions may be mediated by receptor protein fragments, 
particularly in the functional ligand binding or tyrosine 
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kinase activities. Thus, the introduction of analogues of the 
receptor domains to natural or other receptor polypeptides may 
serve as an additional means to affect PDGF mediation of ligand 
mediated activities, 

5 . The fourth function of the PDGF receptor is as a 

binding substrate for other proteins, e.g., the PI3 kinase, in 
particular, the PDGF receptor is phosphorylated at various 
positions in response to ligand binding or other events. This 
binding interaction activates an enzymatic activity on the part 

10 of the binding protein which activates further cellular or 
metabolic responses. 

The term "ligand" refers to the molecules, usually 
members of the platelet-derived growth factor family, that are 
bound by the ligand binding regions (LBR's). The binding 

15 regions are typically found in the XR. Also, a ligand is a 

molecule that serves either as the natural ligand to which the 
receptor binds, or a functional analogue of a ligand. The 
analogue may serve as an agonist or antagonist. Typically 
ligands will be molecules which share structural features of 

20 natural PDGF, e.g. , polypeptides having similar amino acid 

sequences or other molecules sharing molecular features with a 
ligand. The determination of whether a molecule serves as a 
ligand depends upon the measurement of a parameter or response 
which changes upon binding of that ligand, such as dimerization 

25 or tyrosine kinase activity. See, e.g., Gilman et al. (eds) 
(1990) Goodman and Gilman's: The Pharmacological Bases of 
Therapeutics . 8th Ed., Pergamon Press, which is incorporated 
herein by reference. 

The receptor has ligand binding regions (LBR) , or 

30 regions which are important in determining both affinity and 

specificity of binding of ligand, e.g., PDGF and its analogues. 
The ligand binding regions determine the binding interactions 
between the receptors and ligand. Typically, these regions are 
those contact points between the ligand molecule and the 

35 receptor. These molecular interactions can be determined by 

crystallographic techniques, or by testing which regions of the 
receptor are important in ligand interaction. Various segments 
of the extracellular region of the PDGF receptor make up the 
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ligand binding regions, while other segments form structural 
segments which spatially orient the LBR's in proper arrangement 
to properly bind the ligands 

Generally, the fragment will have a sequence of at 
5 least about 6 contiguous amino acids , usually at least about 8 
contiguous amino acids , more usually at least about 10 
contiguous amino acids, preferably at least "about 13 contiguous 
amino acids, and more preferably at least about 15 or more 
contiguous amino acids. Usually, the LBR'.s will be located 

10 within the intra-cysteine (or equivalent) residues of each Ig- 
like domain/ e.g. , domains Dl, D2 f D3, D4, and D5. They will 
- be preferably derived from D3 sequences, but Dl and D2 derived 
sequences will also be common. Occasionally, sequences from 
D4, D5, or other proteins will provide LBR function. 

15 The extra-cysteine (or equivalent) regions provide 

structural functions, as will inter-domain spacer segments. 
The intra-cysteine portions, or segments, are indicated in 
Tables 4 and 5, and comprise the segments designated c, C , C" , 
D, and E, along with portions of the B and F segments, as 

20 indicated. The extra-cysteine residues comprise the segments 
designated A and G, and portions of B and F. 

The ligand binding regions as defined, in part, by 
the importance of their presence, or their effect on the 
affinity oif PDGF ligand binding. The natural, native full 

25 length PDGF-R binds with a K d of about 0.2 mM. See, e.g., Duan 
et al. (1991) J. Biol. Chem. 266:413-418, which is hereby 
incorporated herein by reference. An I*BR is a segment of 
polypeptide whose presence significantly affects ligand 
binding, generally by at least about a factor of two, usually 

30 by at least about a factor of four, more usually by at least a 
factor of about eight, and preferably by at least about a 
factor of twelve or more. A fragment of this invention which 
binds to "the PDGF ligand will generally bind with a K d of less 
than about 10 pM, more generally less than about l /iM, usually 

35 less than about 0.1 jiM, more usually less than about 10 nM, 

preferably less than about 1 nM, and more preferably less than 
about 0.5 nM. 
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An epitope is an antigenic determinant which 
potentially or actually has elicited an antibody response. It 
may also refer to a structural feature which is defined by an 
antibody binding region, or its equivalent. An epitope need 
5 not necessarily be immunogenic, but will serve as a binding 
site for an antibody molecule or its equivalent. 

II. Polypeptides 

Table 1 discloses the sequence of one allele of a 

10 type B human platelet-derived growth factor receptor 
polypeptide. Both a nucleic acid sequence and its 
corresponding protein sequence are provided. The nucleic acid 
sequence corresponds to Seq. ID No. 1. The amino acid sequence 
corresponds to Seq. ID No. 2. A homologous mouse sequence was 

15 reported in Yarden et al. (1988) Nature 323:226-232. The 
sequence of a mouse PDGF receptor polypeptide also exhibits 
structural features in common with the regions, the domains , 
and the 0 -strand segments of the human receptor polypeptides . 
The mouse polypeptides, and those from other related receptors, 

20 will serve as a source of similar domains, homologous 0-strand 
segments, and inter-segment sequences, and sequences of 
homology for general replacement or substitutions. 
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TABLE 1 

Sequence of one type B human PDGF 
receptor polypeptide allele and protein 

• TGraCTCCTGiGCCTTCAGGAGCCTGCACCMTCCTGCCTGTCCCTCTACTC 52 

AGCTGTTACCCACTCTGGGACCAGCAGTCTTTCTGA^ "| 
TGGAGGGGCTGACTCTCCAGAGCCTGGAACTCTGCCCACACCAGAAGCCATCAG 186 

ATG CGG CTT CCG GGT GCG ATG CCA GCT CTG GCC CTC AAA GGC GAG CTG CTG 237 
Het Arg Leu Pro Gly Ala Met Pro Ala Leu Ala Leu Lys Gly Glu Leu Leu -15 . 

TTG CTG TCT CTC CTG TTA CTT CTG GAA CCA CAG ATC TCT CAG GGC CTG GTC 288 
Leu Leu Ser Leu Leu Leu Leu Leu Glu Pro Gin He Ser Gin Gly Leu Val 2 

GTC ACA CCC CCG GGG CCA GAG CTT GTC CTC AAT GTC TCC AGC ACC TTC GTT 339 
Val Thr Pro Pro Gly Pro Glu Leu Val Leu Asn Val Ser Ser Thr Phe Val 19 

CTG ACC TGC TCG GGT TCA GCT CCG GTG GTG TGG GAA CGG ATG TCC CAG GAG 390 
Leu Thr cys Ser Gly Ser Ala Pro Val Val Trp Glu Arg Met Ser Gin Glu 36 

CCC CCA CAG GAA ATG GCC AAG GCC CAG GAT GGC ACC TTC TCC AGC GTG CTC 441 
Pro Pro Gin Glu Het Ala Lys Ala Gin Asp Gly Thr Phe Ser Ser Val Leu 53 

ACA CTG ACC AAC CTC ACT GGG CTA GAC ACG GGA GAA TAC TTT TGC ACC CAC 492 
Thr Leu Thr Asn Leu Thr Gly Leu Asp Thr Gly Glu Tyr Phe Cys Thr Hxs 70 

AAT GAC TCC CGT GGA CTG GAG ACC GAT GAG CGG AAA CGG CTC TAC ATC TTT 543 

Asn Asp Ser Arg Gly Leu Glu Thr Asp Glu Arg Lys Arg Leu Tyr He Phe 87 

GTG CCA SAT CCC ACC GTG GGC TTC CTC CCT AAT GAT GCC GAG GAA CTA TTC 594 
Val Pro Asp Pro Thr Val Gly Phe Leu Pro Asn Asp Ala Glu Glu Leu Phe 104 

ATC TTT CTC ACG GAA ATA ACT GAG ATC ACC ATT CCA TGC CGA GTA ACA GAC 645 
lie Phe Leu Thr Glu He Thr Glu He Thr He Pro Cys Arg Val Thr Asp 121 

CCA CAG CTG GTG GTG ACA CTG CAC GAG AAG AAA GGG GAC GTT GCA CTG CCT 696 
Pro Gin Leu Val Val Thr Leu His Glu Lys Lys Gly Asp Val Ala Leu Pro 138 

GTC CCC TAT GAT CAC CAA CGT GGC TTT TCT GGT ATC TTT GAG GAC AGA AGC 747 
Val Pro Tyr Asp His Gin Arg Gly Phe Ser Gly He Phe Glu Asp Arg Ser 155 

TAC ATC TGC AAA ACC ACC ATT GGG GAC AGG GAG GTG GAT TCT GAT GCC TAC 798 
Tyr He Cys Lys Thr Thr lie Gly Asp Arg Glu Val Asp Ser Asp Ala Tyr 172 

TAT GTC TAC AGA CTC CAG GTG TCA TCC ATC AAC GTC TCT GTG AAC GCA GTG 849 
Tyr Val Tyr Arg Leu Gin Val Ser Ser He Asn Val Ser Val Asn Ala Val 189 

CAG ACT GTG GTC CGC CAG GGT GAG AAC ATC ACC CTC ATG TGC ATT GTG ATC 900 
Gin Thr Val Val Arg Gin Gly Glu Asn He Thr Leu Met Cys He Val He 206 

GGG AAT GAT GTG GTC AAC TTC GAG TGG ACA TAC CCC CGC AAA GAA AGT GGG 951 
Gly Asn Asp Val Val Asn Phe Glu Trp Thr Tyr Pro Arg Lys Glu Ser Gly 223 
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CGG CTG GTG GAG CCG GTG ACT GAC TTC CTC TTG GAT ATG CCT TAC CAC ATC 1002 
Arg Leu Val Glu Pro Val Thr Asp Phe Leu Leu Asp Met Pro Tyr His He 240 

CGC TCC ATC CTG CAC ATC CCC AGT GCC GAG TTA GAA GAC TCG GGG ACC TAC 1053 
Arg Ser He Leu His He Pro Ser Ala Glu Leu Glu Asp Ser Gly Thr Tyr 257 

ACC TGC AAT GTG ACG GAG AGT GTG AAT GAC CAT CAG GAT GAA AAG GCC ATC 1104 
Thr Cys Asn Val Thr Glu Ser Val Asn Asp His Gin Asp Glu Lys Ala He 274 

AAC ATC ACC GTG GTT GAG AGC GGC TAC GTG CGG CTC CTG GGA GAG GTG GGC 1155 
Asn He Thr Val Val Glu Ser Gly Tyr Val Arg Leu Leu Gly Glu val Gly 291 

ACA CTA CAA TTT GCT GAG CTG CAT CGG AGC CGG ACA CTG CAG GTA GTG TTC 1206 
Thr Leu Gin Phe Ala Glu Leu His Arg Ser Arg Thr Leu Gin Val Val Phe 3 08 

GAG GCC TAC CCA CCG CCC ACT GTC CTG TGG TTC AAA GAC AAC CGC ACC CTG 1257 
Glu Ala Tyr Pro Pro Pro Thr Val Leu Trp Phe Lys Asp Asn Arg Thr Leu .325 

GGC GAC TCC AGC GCT GGC GAA ATC GCC CTG TCC ACG CGC AAC GTG TCG GAG 13 08 
Gly Asp Ser Ser Ala Gly Glu He Ala Leu Ser Thr Arg Asn Val Ser Glu 3 42 

ACC CGG TAT GTG TCA GAG CTG ACA CTG GTT CGC GTG AAG GTG GCA GAG GCT 1359 
Thr Arg Tyr Val Ser Glu Leu Thr Leu Val Arg Val Lys Val Ala Glu Ala 359 

GGC CAC TAC ACC ATG CGG GCC TTC CAT GAG GAT GCT GAG GTC CAG CTC TCC 1410 
Gly His Tyr Thr Met Arg Ala Phe His Glu Asp Ala Glu Val Gin Leu Ser 376 

TTC CAG CTA CAG ATC AAT GTC CCT GTC CGA GTG CTG GAG CTA AGT GAG AGC 1461 
Phe Gin Leu Gin He Asn Val Pro Val Arg Val Leu Glu Leu Ser Glu Ser 393 

CAC CCT GAC AGT GGG GAA CAG ACA GTC CGC TGT CGT GGC CGG GGC ATG CCG 1512 
His Pro Asp Ser Gly Glu Gin Thr Val Arg Cys Arg Gly Arg Gly Met Pro 410 

CAG CCG AAC ATC ATC TGG TCT GCC TGC AG A GAC CTC AAA AGG TGT CCA CGT 1563 
Gin Pro Asn He He Trp Ser Ala Cys Arg Asp Leu Lys Arg Cys Pro Arg 427 

GAG CTG CCG CCC ACG CTG CTG GGG AAC AGT TCC GAA GAG GAG AGC CAG CTG 1614 

Glu Leu Pro Pro Thr Leu Leu Gly Asn Ser ser Glu Glu Glu Ser Gin Leu 444 

GAG ACT AAC GTG ACG TAC TGG GAG GAG GAG CAG GAG TTT GAG GTG GTG AGC 1665 
Glu Thr Asn Val Thr Tyr Trp Glu Glu Glu Gin Glu Phe Glu Val Val Ser 461 

ACA CTG CGT CTG CAG CAC GTG GAT CGG CCA CTG TCG GTG CGC TGC ACG CTG 1716 

Thr Leu Arg Leu Gin His Val Asp Arg Pro Leu Ser Val Arg Cys Thr Leu 478 

CGC AAC GCT GTG GGC CAG GAC ACG CAG GAG GTC ATC GTG GTG CCA CAC TCC 1767 

Arg Asn Ala Val Gly Gin Asp Thr Gin Glu Val He Val Val Pro His Ser 495 

TTG CCC TTT AAG GTG GTG GTG ATC TCA GCC ATC CTG GCC CTG GTG GTG CTC 1818 
Leu Pro Phe Lys Val Val Val He Ser Ala He Leu Ala Leu Val Val Leu 512 
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ACC ATC ATC TCC CTT ATC ATC CTC ATC ATG CTT TGG GAG AAG AAG CCA CGT 1869 
Thr lie lie Ser Leu lie lie Leu lie Met Leu Trp Gin Lys Lys Pro Arg 529 

TAC GAG ATC CGA TGG AAG GTG ATT GAG TCT GTG AGC TCT GAC GGC CAT GAG 1920 
Tyr Glu He Arg Trp Lys Val He Glu Ser Val Ser Ser Asp Gly His Glu 546 

TAC ATC TAC GTG GAC CCC ATG CAG CTG CCC TAT GAC TCC ACG TGG GAG CTG 1971 
Tyr lie Tyr Val Asp Pro Met Gin Leu Pro Tyr Asp Ser Thr Trp Glu Leu 563 

CCG CGG GAC CAG CTT GTG CTG GGA CGC ACC CTC GGC TCT GGG GCC TTT GGG 2022 
Pro Arg Asp Gin Leu Val Leu Gly Arg Thr Leu Gly Ser Gly Ala Phe Gly 580 

CAG GTG GTG GAG GCC ACA GCT CAT GGT CTG AGC CAT TCT CAG GCC ACG ATG 2073 
Gin Val Val Glu Ala Thr Ala His Gly Leu Ser His Ser Gin Ala Thr Met 597 

AAA GTG GCC GTC AAG ATG CTT AAA TCC ACA GCC CGC . AGC AGT GAG AAG CAA 2124 
Lys Val Ala Val Lys Met Leu Lys Ser Thr Ala Arg Ser Ser Glu Lys Gin 614 

GCC CTT ATG TCG GAG CTG AAG ATC ATG AGT CAC CTT GGG CCC CAC CTG AAC 2175 
Ala Leu Met Ser Glu Leu Lys He Met Ser His Leu Gly Pro His Leu Asn 631 

GTG GTC AAC CTG TTG GGG GCC TGC ACC AAA GGA GGA CCC ATC TAT ATC ATC 2226 
Val Val Asn Leu Leu Gly Ala Cys Thr Lys Gly Gly Pro He Tyr He He 648 

ACT GAG TAC TGC CGC TAC GGA GAC CTG GTG GAC TAC CTG CAC CGC AAC AAA 2277 
Thr Glu Tyr Cys Arg Tyr Gly Asp Leu Val Asp Tyr Leu His Arg Asn Lys 665 

CAC ACC ZTTC CTG CAG CAC CAC TCC GAC AAG CGC CGC CCG CCC AGC GCG GAG 2328 
His Thr Phe Leu Gin His His Ser Asp Lys Arg Arg Pro Pro Ser Ala Glu 682 

CTC TAC AGC AAT GCT CTG CCC GTT GGG CTC CCC CTG CCC AGC CAT GTG TCC 2379 
Leu Tyr Ser Asn Ala Leu Pro Val Gly Leu Pro Leu Pro Ser His Val Ser 699 

TTG ACC GGG GAG AGC GAC GGT GGC TAC ATG GAC ATG AGC AAG GAC GAG TCG 2430 
Leu Thr Gly Glu Ser Asp Gly Gly Tyr Met Asp Met Ser Lys Asp Glu Ser 716 

GTG GAC TAT GTG CCC ATG CTG GAC ATG AAA GGA GAC GTC AAA TAT GCA GAC 2481 
Val Asp Tyr Val Pro Met Leu Asp Met Lys Gly Asp Val Lys Tyr Ala Asp 733 

ATC GAG TCC TCC AAC TAC ATG GCC CCT TAC GAT AAC TAC GTT CCC TCT GCC 2532 
He Glu Ser Ser Asn Tyr Met Ala Pro Tyr Asp Asn Tyr Val Pro Ser Ala 750 

CCT GAG AGG ACC TGC CGA GCA ACT TTG ATC AAC GAG TCT CCA GTG CTA AGC 2583 
Pro Glu Arg Thr cys Arg Ala Thr Leu He Asn Glu Ser Pro Val Leu Ser 767 

TAC ATG GAC CTC GTG GGC TTC AGC TAC CAG GTG GCC AAT GGC ATG GAG TTT 2634 

Tyr Met Asp Leu Val Gly Phe Ser Tyr Gin Val Ala Asn Gly Met Glu Phe 784 

CTG GCC TCC AAG AAC TGC GTC CAC AGA GAC CTG GCG GCT AGG AAC GTG CTC 2685 
Leu Ala Ser Lys Asn Cys Val His Arg Asp Leu Ala Ala Arg Asn Val Leu 801 
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ATC TGT GAA GGC AAG CTG GTC AAG ATC TGT GAC TTT GGC CTG GCT CGA GAC 2736 
lie cys Glu Gly Lys Leu Val Lys lie Cys Asp Phe Gly Leu Ala Arg Asp 8 IB 

ATC ATG CGG GAC TCG AAT TAC ATC TCC AAA GGC AGC ACC TTT TTG CCT TTA 2787 
lie Met "Arg Asp Ser Asn Tyr lie Ser Lys Gly Ser Thr Phe Leu Pro Leu 835 

AAG TGG ATG GCT CCG GAG AGC ATC TTC AAC AGC CTC TAC ACC ACC CTG AGC 2838 
Lys Trp Met Ala Pro Glu Ser lie Phe Asn Ser Leu Tyr Thr Thr Leu ser 852 

GAC GTG TGG TCC TTC GGG ATC CTG CTC TGG GAG ATC TTC ACC TTG GGT GGC 2880 
Asp Val Trp Ser Phe Gly He Leu Leu Trp Glu He Phe Thr Leu Gly Gly 869 

ACC CCT TAC CCA GAG CTG CCC ATG AAC GAG CAG TTC TAC AAT GCC ATC AAA 2940 

Thr Pro Tyr Pro Glu Leu Pro Met Asn Glu Gin Phe Tyr Asn Ala lie Lys 886 

CGG GGT TAC CGC ATG GCC CAG CCT GCC CAT GCC TCC GAC GAG ATC TAT GAG 2991 
Arg Gly Tyr Arg Met Ala Gin Pro Ala His Ala Ser Asp Glu He Tyr Glu 903 

ATC ATG CAG AAG TGC TGG GAA GAG AAG TTT GAG ATT CGG CCC CCC TTC TCC 3042 
He Met Gin Lys Cys Trp Glu Glu Lys Phe Glu He Arg Pro Pro Phe Ser 920 

CAG CTG GTG CTG CTT CTC GAG AGA CTG TTG GGC GAA GGT TAC AAA AAG AAG 3093 
Gin Leu Val Leu Leu Leu Glu Arg Leu Leu Gly Glu Gly Tyr Lys Lys Lys 937 

TAC CAG CAG GTG GAT GAG GAG TTT CTG AGG AGT GAC CAC CCA GCC ATC CTT 3144 
Tyr Gin Gin Val Asp Glu Glu Phe Leu Arg Ser Asp His Pro Ala He Leu 954 

CGG TCCJCAG GCC CGC TTG CCT . GGG TTC CAT GGC CTC CGA TCT CCC CTG GAC 3195 
Arg Ser Gin Ala Arg Leu Pro Gly Phe His Gly Leu Arg Ser Pro Leu Asp 971 

ACC AGC TCC GTC CTC TAT ACT GCC GTG CAG CCC AAT GAG GGT GAC AAC GAC 3246 
Thr Ser Ser Val Leu Tyr Thr Ala Val Gin Pro Asn Glu Gly Asp Asn Asp 989 

TAT ATC ATC CCC CTG CCT GAC CCC AAA CCT GAG GTT GCT GAC GAG GGC CCA 3297 
Tyr He He Pro Leu Pro Asp Pro Lys Pro Glu Val Ala Asp Glu Gly Pro 1005 

CTG GAG GGT TCC CCC AGC CTA GCC AGC TCC ACC CTG AAT GAA GTC AAC ACC 3 348 
Leu Glu Gly Ser Pro Ser Leu Ala Ser Ser Thr Leu Asn Glu Val Asn Thr 1022 

TCC TCA ACC ATC TCC TGT GAC AGC CCC CTG GAG CCC CAG GAC GAA CCA GAG 3399 
Ser Ser Thr He Ser Cys Asp Ser Pro Leu Glu. Pro Gin Asp Glu Pro Glu 1039 

CCA GAG CCC CAG CTT GAG CTC CAG GTG GAG CCG GAG CCG GAG CTG GAA CAG 3450 
Pro Glu Pro Gin Leu Glu Leu Gin Val Glu Pro Glu Pro Glu Leu Glu Gin 1056 

TTG CCG GAT TCG GGG TGC CCT GCG CCT CGG GCG GAA GCA GAG GAT AGC TTC 3501 
Leu Pro Asp Ser Gly Cys Pro Ala Pro Arg Ala Glu Ala Glu Asp Ser Phe 1073 



CTG TAGGGGGCTGGCCCCTACCCTGCCCTGCCTGAAGCTCCCCCGCTGCCAGCACCCAGCATCTCC 3567 
Leu 1074 
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TGGCCTGGCCTGGCCGGGCTTCCTGTCAGCCAGGCTGCCCTTATCM * "4 

TCTGCTCCTGACGTCn^GCCCCAAA^^ "01 

CTGACCAGCCCTCTGCCTCCAGGGAGGCCAACTGACTCrGAGCCAGGGTTC 3768 

TTTCCCATATGTAAGATGGGAAAGTTAGGCTTGATGACCCAGAAT 3835 

CAGGTGGGGAGACCGAATCCCTCCCTCGGAAGATTCTTGGAG ^902 

TTCTGTTCAGCCAGCTACCCCTCAAGGAATCATAGCTCTCTCCTCGCACTTOTATCCA 3969 

TAGGGAAGAGACCCTAGCCTCCCTGGCTGCTGGCTGAGCTAGGGCCTM 4036 
~+ « ~+ » * r*r*r** rmrwrmnnnrx to xn«nr»CTrcCTCCGTTCCCTGGCCCGAGCTGGTCT 4103 



4170 
4237 



GGGGCCAXTAGGCAGCCTAAJTTAATGCTGGAGGCTC 

CCTTGCCCAGGGCACTTGGAGCAC^CGCAGCCATAGCAAGTGCCTGTGTC — ■> . 

TCAGTCCrTGGGGCriTTTTOT?TATCAC 4304 
CGGGCCCCGC^TCTGTGATGAGAATGTAAATGTGCCAGTGTGGAGTGGCCACGTGTGTGTGCCAGAT 4371 

ATGGCCCTGGCTCTGCATTGGACCrrcCTATGA^ 4438 
AGTTTCCCCTTCAAAAAATGAATAAGTCX^ACTTATTAACTCTGAGTGCCTTC 4505 
CTAGAGTATCCAGGTGGTTGCACATTTGTCCAGATGAAGCAAM 4572 
TGGGGGTCAGCTGGGCTCCTGGGAGATTCCAGATCACACATCACACTCTGGGGACTCAGGAACCATG 



4639 



TGGGGGTCAGCTGGGCTCCTGGGAGATTCCAGATCACACATCACACTCTGGGGA 

CCCXTTTCCCCAGGCarCCAGCA^ 4706 
CGGTGTCCTGGAAAGCCCCCAGCAGCTGCCCCAGGGACATGGGAAGACCACGGGACCTCTTT^ 4773 
CCCACGATGACCTCCGGGGGTATCCTGGGCAAAAGGGACAAAGAGGGCAAATGAGATCACCTCCTGC 4 840 
AGCCCACCACTCCAGCACCTGTGCCGAGGTCTGCGTCGAAGA^^ 4 907 

GTCTTGTAAAAGACAAGAAG CTTCAG ATGGGTACCCCAAGAAGGATGTGAG AGGTGGG ^£^^ GG ^ 4974 
GGTTTCCCCCTCACCCACCAGCTGCCCCATCCCTGAGGCAGCGCTCCAT^ 5041 



GAGGGTTGTAGCCAAGACGCCCCCGCACGGGGAGGGTTGGGAAGGGGGTGCAGGAAGeTUAA = 

CTGGGCACCAACCCTGCAXTGCAGGTTGGCACCTTACTTCCCTGGGATCCCA 5242 

GGGAGAGTGGGTTCTCAATACGGTACCAAAGATATAATCAOT 5309 

CACGTTAAGTCACATTTATACAGCAGAAATGCTATTTTGTATGCTGTO 5376 

CTTTTTTTTAAGGGAAAGATTTTAATATTAAACCTGGTGCTTCTCACTCAC 5427 
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Table 2 discloses the sequence of an allele of an 
type A human platelet-derived growth factor receptor 
polypeptide. Both a nucleic acid sequence and its 
corresponding protein sequence are provided. The nucleic acid 
sequence corresponds to Seq. ID No. 5. The amino acid sequence 
corresponds to Seq. ID No. 4. Another human type A allele 
sequence is reported in Matsui et al. (1989) Science 243:800- 
803. 
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Sequence of a human type A 
PDGF receptor polypeptide allele and protein 



TTGGAGCTACAGGGAGAGAAACAGAGGAGGAGACTGCAAGAGATCATTGGAGGCCGTGGGC2 
ACGCTCITTACTCCATGTGTGGGACATTCATTGCGGAATAACATCGGAGGAGAAGTTTCCCAGAGCT 



61 
128 



ATG GGG ACT TCC CAT CCG GCG TTC CTG GTC TTA GGC TGT CTT CTC ACA GGG 179 

Met Gly Thr Ser His Pro Ala Phe Leu Val Leu Gly Cys Leu Leu Thr Gly -7 

CTG AGC CTA ATC CTC TGC CAG CTT TCA TTA CCC TCT ATC CTT CCA AAT GAA 230 

Leu Ser Leu lie Leu Cys Gin Leu Ser Leu Pro Ser lie Leu Pro Asn Glu 11 

AAT GAA AAG GTT GTG CAG CTG AAT TCA TCC TTT TCT CTG AGA TGC TTT GGG 281 

Asn Glu Lys Val Val Gin Leu Asn Ser Ser Phe Ser Leu Arg Cys Phe Gly 2B . 

GAG AGT GAA GTG AGC TGG CAG TAC CCC ATG TCT GAA GAA GAG AGC TCC GAT 332 

Glu Ser Glu Val Ser Trp Gin Tyr Pro Met Ser Glu Glu Glu Ser Ser Asp 45 

GTG GAA ATC AGA AAT GAA GAA AAC AAC AGC GGC CTT TTT GTG ACG GTC TTG 383 

Val Glu lie Arg Asn Glu Glu Asn Asn Ser Gly Leu Phe Val Thr Val Leu 62 

6*A GTG AGC AGT GCC TCG GCG GCC CAC ACA GGG TTG TAC ACT TGC TAT TAC 434 

Glu Val Ser Ser Ala Ser Ala Ala His Thr Gly Leu Tyr Thr Cys Tyr Tyr 79 

AAC CAC ACT CAG ACA GAA GAG AAT GAG CTT GAA GGC AGG CAC ATT TAC ATC 485 

Asn His Thr Gin Thr Glu Glu Asn Glu Leu Glu Gly Arg His lie Tyr- lie 95 

TAT GTG CCA GAC CCA GAT GTA GCC TTT GTA CCT CTA GGA ATG ACG GAT TAT 536 

Tyr Val Pro Asp Pro Asp Val Ala Phe Val Pro Leu Gly Met Thr Asp Tyr 113 

TTA GTC -ATC GTG GAG GAT GAT GAT TCT GCC ATT ATA CCT TGT CGC ACA ACT 587 

Leu Val He Val Glu Asp Asp Asp Ser Ala He He Pro Cys Arg Thr Thr 130 

GAT CCC GAG ACT CCT GTA ACC TTA CAC AAC AGT GAG GGG GTG GTA CCT GCC 638 

Asp Pro Glu Thr Pro Val Thr Leu His Asn Ser Glu Gly Val Val Pro Ala 147 

TCC TAC GAC AGC AGA CAG GGC TTT AAT GGG ACC TTC ACT GTA GGG CCC TAT 689 

Ser Tyr Asp Ser Arg Gin Gly Phe Asn Gly Thr Phe Thr Val Gly Pro Tyr 164 

ATC TGT GAG GCC ACC GTC AAA GGA AAG AAG TTC CAG ACC ATC CCA TTT AAT 740 

He Cys Glu Ala Thr Val Lys Gly Lys Lys Phe Gin Thr He Pro Phe Asn 181 

GTT TAT GCT TTA AAA GCA ACA TCA GAG CTG GAT CTA GAA ATG GAA GCT CTT 791 

Val Tyr Ala Leu Lys Ala Thr Ser Glu Leu Asp Leu Glu Met Glu Ala Leu 198 

AAA ACC GTG TAT AAG TCA GGG GAA ACG ATT GTG GTC ACC TGT GCT GTT TTT 842 

Lys Thr Val Tyr Lys Ser Gly Glu Thr He Val Val Thr cys Ala Val Phe 215 

AAC AAT GAG GTG GTT GAC CTT CAA TGG ACT TAC CCT GGA GAA GTG AAA GGC 893 

Asn Asn Glu. Val Val Asp Leu Gin Trp Thr Tyr Pro Gly Glu Val Lys Gly 232 
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AAA GGC ATC ACA ATG CTG GAA GAA ATC AAA GTC CCA TCC ATC AAA TTG GTG .944 
Lys Gly lie Thr Met Leu Glu Glu lie Lys Val Pro Ser lie Lys Leu Val 249 

TAC ACT TTG ACG GTC CCC GAG GCC ACG GTG AAA GAC AGT GGA GAT TAC GAA 995 
Tyr Thr Leu Thr val Pro Glu Ala Thr Val Lys Asp Ser Gly Asp Tyr Glu 266 

TGT GCT GCC CGC CAG GCT ACC AGG GAG GTC AAA GAA ATG AAG AAA GTC ACT 104 6 
Cys Ala Ala Arg Gin Ala Thr Arg Glu Val Lys Glu Met Lys Lys Val Thr 283 

ATT TCT GTC CAT GAG AAA GGT TTC ATT GAA ATC AAA CCC ACC TTC AGC CAG 1097 
lie Ser Val His Glu Lys Gly Phe lie Glu He Lys Pro Thr Phe Ser Gin 300 

TTG GAA GCT GTC AAC CTG CAT GAA GTC AAA CAT TTT GTT GTA GAG GTG CGG 1148 
Leu Glu Ala Val Asn Leu His Glu Val Lys His Phe Val Val Glu Val Arg 317 

GCC TAC CCA CCT CCC AGG ATA TCC TGG CTG AAA AAC AAT CTG ACT CTG ATT 1199 
Ala Tyr Pro Pro Pro Arg He Ser Trp Leu Lys Asn Asn Leu Thr Leu He 334 

GAA AAT CTC ACT GAG ATC ACC ACT GAT GTG GAA AAG ATT CAG GAA ATA AGG 1250 
Glu Ash Leu Thr Glu lie Thr Thr Asp Val Glu Lys He Gin Glu He Arg 351 

TAT CGA AGC AAA TTA AAG CTG ATC CGT GCT AAG GAA GAA GAC AGT GGC CAT 1301 
Tyr Arg ser Lys Leu Lys Leu He Arg Ala Lys Glu Glu Asp Ser Gly His 368 

TAT ACT ATT GTA GCT CAA AAT GAA GAT GCT GTG AAG AGC TAT ACT TTT GAA 1352 
Tyr Thr He Val Ala Gin Asn Glu Asp Ala Val Lys Ser Tyr Thr Phe Glu 385 

CTG TTA ACT CAA GTT CCT TCA TCC ATT CTG GAC TTG GTC GAT GAT CAC CAT 1403 
Leu Leu Thr Gin Val Pro Ser Ser He Leu Asp Leu Val Asp Asp His His 402 

GGC TCA ACT GGG GGA CAG ACG GTG AGG TGC ACA GCT GAA GGC ACG CCG CTT 1454 
Gly ser Thr Gly Gly Gin Thr Val Arg Cys Thr Ala Glu Gly Thr Pro Leu 419 

CCT GAT ATT GAG TGG ATG ATA TGC AAA GAT ATT AAG AAA TGT AAT AAT GAA 1505 
Pro Asp He Glu Trp Met He Cys Lys Asp He Lys Lys Cys Asn Asn Glu 436 

ACT TCC TGG ACT ATT TTG GCC AAC AAT GTC TCA AAC ATC ATC ACG GAG ATC 1556 
Thr Ser Trp Thr lie Leu Ala Asn Asn Val Ser Asn He He Thr Glu He 453 

CAC TCC CGA GAC AGG AGT ACC GTG GAG GGC CGT GTG ACT TTC GCC AAA GTG 1607 
His Ser Arg Asp Arg Ser Thr Val Glu Gly Arg Val Thr Phe Ala Lys val 470 

GAG GAG ACC ATC GCC GTG CGA TGC CTG GCT AAG AAT CTC CTT GGA GCT GAG 1658 
Glu Glu Thr He Ala Val Arg Cys Leu Ala Lys Asn Leu Leu Gly Ala Glu 4 87 

AAC CGA GAG CTG AAG CTG GTG GCT CCC ACC CTG CGT TCT GAA CTC ACG GTG 1709 
Asn Arg Glu Leu Lys Leu Val Ala Pro Thr Leu Arg Ser Glu Leu Thr Val 504 

GCT GCT GCA GTC CTG GTG CTG TTG GTG ATT GTG ATC ATC TCA CTT ATT GTC 1760 
Ala Ala Ala Val Leu Val Leu Leu Val He Val lie He Ser Leu He Val 521 



* 



• 
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CTG GTT GTC ATT TGG AAA CAG AAA CCG AGG TAT GAA ATT CGC TGG AGG GTC 1811 
Leu Val yal He Trp Lys Gin Lys Pro Arg Tyr Glu He Arg Trp Arg Val 538 

ATT GAATCA ATC AGC CCA GAT GGA CAT GAA TAT ATT TAT GTG GAC CCG ATG 1862 
He Glu ser He Ser Pro Asp Gly His Glu Tyr He Tyr Val Asp Pro Met 555 

GAG CTG CCT TAT GAC TCA AGA TGG GAG TTT CCA AGA GAT GGA CTA GTG CTT 1913 
Gin Leu Pro Tyr Asp Ser Arg Trp Glu Phe Pro Arg Asp Gly Leu Val Leu 572 

GGT CGG GTC TTG GGG TCT GGA GCG TTT GGG AAG GTG GTT GAA GGA ACA GCC 1964 
Gly Arg Val Leu Gly Ser Gly Ala Phe Gly Lys Val Val Glu Gly Thr Ala 589 

TAT GGA TTA AGC CGG TCC CAA CCT GTC ATG AAA GTT GCA GTG AAG ATG CTA 2015 
Tyr Gly Leu Ser Arg Ser Gin Pro Val Met Lys Val Ala Val Lys Met Leu 606 

AAA CCC ACG GCC AGA TCC ACT GAA AAA CAA GCT CTC ATG TCT GAA CTG AAG 2066 
Lys Pro Thr Ala Arg ser Ser Glu Lys Gin Ala Leu Met ser Glu Leu Lys 623 

ATA ATG ACT CAC CTG GGG CCA CAT TTG AAC ATT GTA AAC TTG CTG GGA GCC 2117 
He Met Thr His Leu Gly Pro His Leu Asn He Val Asn Leu Leu Gly Ala 640 

TGC ACC AAG TCA GGC CCC ATT TAC ATC ATC ACA GAG TAT TGC TTC TAT GGA 2168 
cys Thr Lys Ser Gly Pro He Tyr He He Thr Glu Tyr cys Phe Tyr Gly 657 

GAT TTG GTC AAC TAT TTG CAT AAG AAT AGG GAT AGC TTC CTG AGC CAC CAC 2219 
Asp Leu Val Asn Tyr Leu His Lys Asn Arg Asp Ser Phe Leu Ser His His 674 

CCA GAG AAG CCA AAG AAA GAG CTG GAT ATC TTT GGA TTG AAC CCT GCT GAT 2270 
Pro Glu Lys Pro Lys Lys Glu Leu Asp He Phe Gly Leu Asn Pro Ala Asp 691 

GAA AGC ACA CGG AGC TAT GTT ATT TTA TCT TTT GAA AAC AAT GGT GAC TAC 2321 
Glu Ser Thr Arg Ser Tyr Val He Leu Ser Phe Glu Asn Asn Gly Asp Tyr 708 

ATG GAC ATG AAG CAG GCT GAT ACT ACA CAG TAT GTC CCC ATG CTA GAA AGG 2372 
Met Asp Met Lys Gin Ala Asp Thr Thr Gin Tyr Val Pro Met Leu Glu Arg 725 

AAA GAG GTT TCT AAA TAT TCC GAC ATC CAG AGA TCA CTC TAT GAT CGT CCA 2423 
Lys Glu Val Ser Lys Tyr Ser Asp He Gin Arg Ser Leu Tyr Asp Arg Pro 742 

GCC TCA TAT AAG AAG AAA TCT ATG TTA GAC TCA GAA GTC AAA AAC CTC CTT 2474 
Ala Ser Tyr Lys Lys Lys Ser Met Leu Asp Ser Glu Val Lys Asn Leu Leu 759 

TCA GAT GAT AAC TCA GAA GGC CTT ACT TTA TTG GAT TTG TTG AGC TTG ACC 2525 
Ser Asp Asp Asn Ser Glu Gly Leu Thr Leu Leu Asp Leu Leu Ser Phe Thr 776 

TAT CAA GTT GCC CGA GGA ATG GAG TTT TTG GCT TCA AAA AAT TGT GTC CAC 2576 
T^yr Gin Val Ala Arg Gly Met Glu Phe Leu Ala Ser Lys Asn Cys Val Hzs 793 

CGT GAT CTG GCT GCT CGC AAC GTT CTC CTG GCA CAA GGA AAA ATT GTG AAG 2627 
Arg Asp Leu Ala Ala Arg Asn Val Leu Leu Ala Gin Gly Lys He Val Lys 810 
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ATC TGT GAC TTT GGC CTG GCC AGA GAC ATC ATG CAT GAT TCG AAC TAT GTG 2678 
He cys Asp Phe Gly Leu Ala Arg Asp He Met His Asp Ser Asn Tyr Val 827 

TCG AAA GGC AGT ACC TTT CTG CCC GTG AAG TGG ATG GCT CCT GAG AGC ATC 2729 
Ser Lys Gly Ser Thr Phe Leu Pro Val Lys Trp Met Ala Pro Glu Ser -lie 844 

TTT GAC AAC CTC TAC ACC ACA CTG AGT GAT GTC TGG TCT TAT GGC ATT CTG 2780 
Phe Asp Asn Leu Tyr Thr Thr Leu Ser Asp Val Trp Ser Tyr Gly He Leu 861 

CTC TGG GAG ATC TTT TCC CTT GGT GGC ACC CCT TAC CCC GGC ATG ATG GTG 2831 
Leu Trp Glu He Phe Ser Leu Gly Gly Thr Pro Tyr Pro Gly Met Met Val 878 

GAT TCT ACT TTC TAC AAT AAG ATC AAG AGT GGG TAC CGG ATG GCC AAG CCT 2882 
Asp Ser Thr Phe Tyr Asn Lys He Lys Ser Gly Tyr Arg Met Ala Lys Pro 895 

GAC CAC GCT ACC AGT GAA GTC TAC GAG ATC ATG GTG AAA TGC TGG AAC AGT 2933 
Asp His Ala Thr Ser Glu Val Tyr Glu He Met Val Lys cys Trp Asn Ser 912 

GAG CCG GAG AAG AGA CCC TCC TTT TAC CAC CTG AGT GAG ATT GTG GAG AAT 2984 
Glu Pro Glu Lys Arg Pro Ser Phe Tyr His Leu Ser Glu He Val Glu Asn 929 

CTG CTG CCT GGA CAA TAT AAA AAG AGT TAT GAA AAA ATT CAC CTG GAC TTC 3035 
Leu Leu Pro Gly Gin Tyr Lys Lys Ser Tyr Glu Lys He His Leu Asp Phe 946 

CTG AAG AGT GAC CAT CCT GCT GTG GCA CGC ATG CGT GTG GAC TCA GAC AAT 3086 
Leu Lys Ser Asp His Pro Ala Val Ala Arg Met Arg Val Asp Ser Asp Asn 963 

GCA TAC ATT GGT GTC ACC TAC AAA AAC GAG GAA GAC AAG CTG AAG GAC TGG 3137 
Ala Tyr lie Gly Val Thr Tyr Lys Asn Glu Glu Asp Lys Leu Lys Asp Trp 980 

GAG GGT GGT CTG GAT GAG CAG AGA CTG AGC GCT GAC . AGT GGC TAC ATC ATT 3188 
Glu Gly Gly Leu Asp Glu Gin Arg Leu Ser Ala Asp Ser Gly Tyr He He 997 

CCT CTG CCT GAC ATT GAC CCT GTC CCT GAG GAG GAG GAC CTG GGC AAG AGG 3239 

Pro Leu Pro Asp He Asp Pro Val Pro Glu Glu Glu Asp Leu Gly Lys Arg 1014 

AAC AGA CAC AGC TCG CAG ACC TCT GAA GAG AGT GCC ATT GAG ACG GGT TCC 3290 
Asn Arg His Ser Ser Gin Thr Ser Glu Glu Ser Ala He Glu Thr Gly Ser 1031 

AGC AGT TCC ACC TTC ATC AAG AGA GAG GAC GAG ACC ATT GAA GAC ATC GAC 3341 
Ser Ser Ser Thr Phe He Lys Arg Glu Asp Glu Thr He Glu Asp He Asp 1048 

ATG ATG GAC GAC ATC GGC ATA GAC TCT TCA GAC CTG GTG GAA GAC AGC TTC 33 92 
Met Met Asp Asp He Gly He Asp Ser Ser Asp Leu Val Glu Asp Ser Phe 1065 

CTG TAACTGGCGGATTCGAGGGGTTCCTTCCACTTCTGGGGCCACCTCTGGATCCCGTTCAGAAAA 345B 
Leu 1066 

CCACTTTATTGCAATGCGGAGGTTGAGAGGAGGACTTGGTTGATGTTTAAAGAGAAGTTCCCAGCCA 3525 
" AGGGCCTCGGGGAGCCTTTCTAAATATGAATGAATC^ 3592 
CCTCTTGCAATGCCTCAGTAGCATCTC^GTGGTGTGTGAAGTTTGGAGATAGATGGATAAGGGAATA 3659 
ATAGGCCACAGAAGGTGAACTTTCTGCTTCAAGGACATTGGTGAGAGTCCAACAGACACAATTTATA 3726 
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CTGCGACAGAACTTCAGCATTGTAATTATGTAAATAACTCTAAC^ 
AACTATOTCCTTGGACTTCTG^ 

TGTCAGCTG^TOTTGAACTTTTTAAAGAAGTGCRTGAAAAACCAT^ 

TA^ATAG^TTraC^C^^ Jg* 




ACTACTGTTATCAGTAATGCTAAATGTG'l!AATAA'i.-UTAAlJAX\»AXl l v->» i , 1QC - 

tiaw^^^actaagtaggtgatgagtt^^ "If 

CTCAATGTAGAGGCATAAACCTGTGCTGAACATAACTTCTCATGTATATTACC^^ *W 
ATGA^^GCGCANAAAGACnXSGATTrGCAGAAGTTNlTriTrri'X'l 1 ^^^^^^^E^^i^pAg Jg? 

TTTOTCGACCCCAATATATGTATTTTrTGA^^ JgJ 

A^TOGCCTCOTCTTTCACCCOTACCCCAAAGAG^ *«•* 

SrOTTAGTGGAGGCrOSAA^ "" 

tIgSgtacct*^ "I! 

TATTmCTAAACTCCCT^CrGrrCTGATCGGCCAGGTTTC^ 

ttTAAATTTAGGAACCGAAGTCCAATCAC^ 4M9 

AATTGAXAGGT^AATCGACTCCGACTCrmCGATTTC^ 5066 

TTCTACGATGAAGGGTGACATACCCCCTCTAACTTGAAAGGGCCAGAGGGCAG^^ 5133 

AGGTATGGGGCGGTTCCTrTCCGTACATGT^ 5200 

^Sacaaaaccaccttt™ 

ACTTAA^GTGTAGATAGGTGTGACAGTTTCTCCAACCACACCCAAGTAAC^ f.lt 

ACGAAT^rAACGACTATGGTATACTTACTTTGTACCCGACACTAATGACGTTA |401 

TCTACTAGJAAACCTTCTACGTCTOroTTATTATTTCA 5468 

S^ttSgaataggttgaaaaactat^ |g* 

rTOCTOTGGACTGATCCAAGACATCTCGGTTAATCTGAAOT 5602 

TCTCGAGTTCGTAAGACAAATAGCGAGTGAGAGGGAACATGTC^ 5669 

AAACTATAACGACACTCGGAACGTACTGTAGTACTCCGGCCTACTTTGAAGAGT^GGTC 5736 

GTCAKATTGTTTACGAGGGTGGACTTAAACATATACTGACGTAA^ 

. CGTTTAAGGTCTAAACAAAGGAAAACCGGAGGACCT g£ 



CGTTTAAGGTCTAAACAAACKiAAAAUUl^A^AL.L. i. r „: : I " -p^ > „ » 1 &TTTGC 5937 
GATGAAAGATAAAAATACTACTGTTAGTTTCGGCCGGACTCTTTCTGAT 5937 
TAATCACTACAGGAATTTTACACCAGACGGTTAGAC^TG^ 

gtat^tWctacaatatgtagttatacatatatacataaagatatw mi 
Sgttttgtaaatactgttcgacatagtgacggaagcaaatataaaa^ llll 

GTCTGTGTAATTGACAACGTGAAAACTTACAGGTTTTAAATATAAAATCTTTATT^ "OS 



CTATCTTTTACATAATGCTTACGGGGACAAGTACAAAAACAA "39 
TATAAAGTTATTTACTATATATTAAATTTCCTTAAG 
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A polypeptide or nucleic acid is substantially pure, 
or substantially purified, when it comprises at least about 30% 
of the respective polymer in a composition, typically at least 
about 50%, more typically at m least about 70%, usually at least 
about 80%, more usually at least about 90%, preferably at least 
about 95%, and more preferably about 98% or more. 

The soluble fragments of the extracellular region 
will generally be less than about 400 amino acids, usually less 
than about 350 amino acids, more usually less than about 300 
amino acids, typically less than about 200 amino acids, and 
preferably less than about 150 amino acids. 

A. D Domains 

Based on a number of observations, the extracellular 
region (XR) of these PDGF receptor polypeptides comprises 5 
immunoglobulin-like domains. First, the amino acid sequence 
contains 5 segments characteristic of Ig-like domain 
structures, each of the segments having an appropriate size for 
an immunoglobulin domain. Each segment, except for the fourth, 
has characteristically spaced cysteine residues that are a 
diagnostic feature of an immunoglobul in-like domain. The 
receptor polypeptide sequence displays other features of 
immunoglobul in-like domain structure, e.g., the presence of 
characteristically positioned tryptophan and tyrosine residues. 
Direct sequence comparisons of segments of the receptor 
polypeptides with corresponding segments of true immunoglobulin 
domains shows a statistically significant similarity between 
PDGF receptor polypeptide domains and immunoglobulin domains . 
See, e.g., Williams (1989) Science 243: 1564-1570. The 
argument that the receptor polypeptide domains assume the 
folding pattern of immunoglobulin domains can be strengthened 
by examining the predicted secondary structure of the receptor 
polypeptides. 

When a homology mapping analysis is performed, the 
PDGF receptor polypeptide shows five Ig-like domains in the 
extracellular region, each domain showing statistically 
significant homology to defined Ig-like domains, see, e.g., 
Williams and Barclay (1988) Ann. Rev. Immunol: Biochem. 6: 381- 
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405. Regions of homology will show significant sequence 
homology to particular Ig-like domains, and exhibit particular 
secondary and tertiary structural motifs characteristic of Ig- 
like domains. The domain structures will preferably be those 
5 segments with boundaries which approximately match the 
boundaries of the domain structures. The boundaries will 
preferably match within about 9 amino acids, typically within 
about 7 amino acids, more typically within about 5 amino acids, 
usually within about 3 amino acids, and more usually within 1 

10 amino acid. See, e.g.. Cantor and Schimmel (1980) Biophysical 
Chemistry . Vols I-III, Freeman and Co., San Francisco; 
Creighton (1984) Proteins: Structur e and Molecular Properties, 
Freeman and Co., New York; and Watson et al. (1987) The 
Molecular Bioloov of the Gene . Vols 1 and 2, Benjamin, Menlo 

15 Park, California; each of which is hereby incorporated herein 
by reference. 

The sequences of the human type B and the human type 
A receptor polypeptides can be analyzed to predict their beta 
strand topology. Combining a Fourier analysis of hydrophobic 

20 sequence pattern and a Garnier-Robson algorithm, see, e.g., 
Garnier et al. (1978) J. Mol. Biol. 120: 97, with a turn 
predictor program, as reported in Cohen et al. (1986) 
Biochemistry 25: 266, produces a characteristic structural 
pattern. This pattern exhibits consensus 0 -strand segments in 

25 each domain when analysed as described. 

. The first two Ig-like domains of the PDGF receptor 
polypeptides, Dl and D2, have about seven 0-strand segments, 
designated the A, B, C, D, E, F, and G segments, as listed from 
amino proximal to carboxy proximal direction. The third, 

30 fourth and fifth Ig-like domains, D3, D4 and D5, are long 
enough to include an extra 0-strand segment, designated C. 
The fifth domain, D5, most closely resembles a variable heavy 
chain domain in length. The type B receptor polypeptide D5 
further comprises an additional £-strand segment designated C". 

35 These features and designations are based partly on the 

homology of segments between domains and segments in the type B 
and type A hPDGF-R polypeptides, and with the mouse type B PDGF 
receptor polypeptide, and also based upon homology to other Ig- 
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like segments found on other proteins, particularly other 
growth factor receptor proteins. The csf-1 receptor and c-kit 
proto-oncogene have similar Ig-like domain" organizations. See, 
e.g., Williams (1989) Science 243:1564-1570. 
5 The domain structure is based, in part, upon features 

common to Ig-like domains found in other proteins, including 
related receptors. See, e.g.; Ullrich and Schlessinger (1990) 
Cell 61:203-212; and Yarden and Ullrich (1988) Ann," Rev. 
Biochem. 57:443-78. The domain boundaries for the two alleles 
10 disclosed herein are identified below, but different alleles 
may have slightly different positions for the boundaries. See 
Table 14. 

The Ig-like domains (D domains) are characterized by 
the regularity of spacing of cysteine residues in the 

15 extracellular region. These five D domains, each about 100 

amino acids in length, have /3-sheet rich structures, resembling 
immunoglobulin variable or constant regions. See, Williams 
(1989) Science 243:1964-1570. The natural XR domains are 
numbered from the amino proximal domain Dl, in order, through 

20 D5, at the carboxy proximal end of the XR. 

The exon structure of the mouse type B PDGF receptor 
polypeptide gene also matches this domain structure with 
reasonable fidelity. The correlation between the intron-exon 
structure and functional units further supports the hypothesis 

25 that the boundaries define functional units of the polypeptide. 
See, e.g., Williams and Barclay (1988) Ann. Rev. Immunol. 
Biochem. 6:381-405. The boundaries for each of these segments 
are indicated below for the two alleles disclosed herein, and 
similar boundaries will be found in other alleles at locations 

30 of sequence and functional homology. 

The amino-proximal Ig-like domain of the human 
platelet-derived growth factor receptor polypeptides is 
designated Dl. The Dl domain extends from about leu(l) to 
pro (91) in the type B receptor polypeptide, and from about 

35 gln(l) to pro (101) in the type A receptor polypeptide. See 
Table 14. The Dl domain apparently has about seven 0-sheet 
segments. 
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The next Ig-like domain , in the carboxy proximal 
direction of natural human platelet-derived growth factor 
receptor polypeptides r is designated D2. The D2 domain extends 
from about thr(92) to ser(181) in the type B receptor 
5 polypeptide, and from about asp (102) to ser(189) in the type A 
receptor polypeptide. The D2 domain apparently also has about 
seven p -sheet strands designated A, B, C, D, E/ *% and G. 

The third Ig-like domain found on natural human PDGF 
receptor polypeptides is designated D3. The D3 domain extends 

10 from about ile(l82) to gly(282) in the type B receptor 

polypeptide, and from about glu(190) to gly(290) in the type A 
receptor polypeptide. The D3 domain apparently has about eight 
0-sheet strands designated A r B, C, C, D r E, F, and G. 

The fourth Ig-like domain found in the natural human 

15 PDGF receptor polypeptides is designated D4. The D4 domain 

extends from about tyr(283) to pro (3 84) in the type B receptor 
polypeptide, and from about phe(291) to pro (391) in the type A 
receptor polypeptide. The D4 domain apparently has about eight 
P -sheet strands. Note that the D4 domains lack the 

20 characteristic cysteine residues, which correspond to val(306) 
and met (364) in the type B sequence shown, and to val(313) and 
ile(371) in the type A sequence shown.. 

The fifth Ig-like domain is designated D5. The D5 
domain extends from about val(385) to lys(499) in the type B 

25 receptor polypeptide, and from about ser(392) to glu(501) in 

the type A receptor polypeptide. The D5 of the type B receptor 
polypeptide has about nine putative 0-sheet strand segments 
designated A, B, C, C», C", D, E, F, and G, while the type A 
receptor polypeptide has only about eight 0-strand segments, 

30 lacking a C" segment. 

The approximate boundaries of the domains and 0- 
strand segments are listed in Table 14. The apparent 
alignments of the segments are illustrated in Tables 4 and 5. 
Other alleles of the receptor polypeptides may also be analyzed 

35 by either homology or the structural analysis as described 
above. 
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The prototypical Dl domains are those sequences of 
the human type B receptor polypeptide and the human type A 
receptor polypeptide, as described. However, compatible amino 
acid substitutions, insertions, and deletions which preserve 
S the desired ligand binding functions can be made. The function 
will usually be preserved by retaining the LBR segments in the 
correct orientation by use of appropriate structured segments . 
Conservative substitutions typically include substitutions 
within the following groups: glycine, alanine? valine, 
10 isoleucine, leucine? aspartic acid, glutamic acid; asparagine, 
glutamine? serine, threonine? lysine, arginine? and 
phenylalanine, tyrosine. Substitution or exchange of 0-sheet 
segments or sequences intermediate the segments from different 
domains may be performed, including between type B and A 
15 receptor polypeptides, or between different domains of another 
related receptor polypeptide. Segments outside the 
prototypical cysteines within 0-segments B and F (but val(306) 
and met(364) in the type B D4, and val(313) and ile(371) in the 
type A D4) will be usually less critical than the sequences 
20 between those residues, e.g., the C, C\ C», D? and E /?-strand 
segments. Also, segments homologous to these disclosed 
segments may be substituted, including those with compatible 
amino acid substitutions, insertions, and deletions. Sources 
of similar domains and segments include related receptor 
25 polypeptides from human or other mammalian species. Non- 
mammalian receptor polypeptides may also exhibit significant 
homology and serve as sources for similar segments. Other Ig- 
like domains and segments may also be substituted. 

The present invention embraces polypeptides which 
30 exhibit homology to the disclosed and described segments and 
domains. It embraces segments comprising contiguous amino 
acids of the sequences disclosed, . typically at least about 8 
contiguous amino acids, more typically at least about 11 
contiguous amino acids, usually at least about 14 contiguous 
35 amino acids, more usually at least about 17 contiguous amino 
acids, and preferably at least about 21 or more contiguous 
amino acids, constructs retaining the LBR segments are most 
valuable. The invention also includes modifications of those 
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sequences, including insertions, deletions, and substitutions 
with other amino acids. Glycosylation modifications, either 
changed, increased amounts, or decreased amounts, as well as 
other sequence modifications are envisioned. Thus, the 
5 modified proteins comprising these amino acid sequences, e.g., 
analogues, will usually be substantially equivalent to these 
proteins in either function or structure. 

The /9-sheet strands may be slightly enlarged or 
shortened by respective insertions or deletions in the 

10 polypeptide sequence. Thus, certain embodiments will have a 
slightly enlarged or shortened particular domain by adding or 
deleting particular sequences of 0-sheet strands or their 
inter-strand sequences. Segments may be inserted or deleted 
which conform to the structural requirements of retaining the 

15 proper intra- and intexr-domain interactions. In particular, 

changes which interrupt the secondary and tertiary structure of 
the protein will be disfavored. See, e.g., Cantor and Schimmel 
(1990) and Creighton (1984). In addition, amino acids or 
segments may be inserted or deleted in the regions outside of 

20 the 0-sheet strands and between domains. Typically the 

substitutions will be of amino acids having similar properties, 
and additions or deletions would preferably be selected among 
those which retain receptor biological functions, e.g., ligand 
binding. 

25 The sequence of a J3- sheet segment will typically not 

differ from a sequence from a human type B polypeptide or a 
human type A polypeptide by greater than about 50%, more 
typically less than about 39%, usually less than about 29%, and 
more usually less than about 20%. Comparable similarities over 

30 each of the non-0-sheet strands of each domain will be 
preferred. 

The boundaries between domains are defined, in part, 
by the definitions for domains in the I g- like domains. 
Examples of similar domains are found in immunoglobulin and 
35 growth factor receptor polypeptides. The domain boundaries 
between Dl and D2; D2 and D3 ; D3 and D4? and D4 and D5 
correspond approximately to exon locations, further supporting 
the proposal that the domain structures correspond to 



• 



WO 92/13867 PCT/US92/0O730 

34 

evolutional and functional units. See, e.g., Watson et al.. 
(1987) The Molecular Biolo gry of the Gene, vols. 1 and 2, 
Benjamin, Menlo Park f California. 

The D2 domains have similar characteristics to the Dl 
domains, as shown by the alignments illustrated in Tables 4 and 
5. Both domains have 0-sheet segments designated A, B, C, D, 
E, F, and G. The domain 3 segments, or D3, also exhibit 
homology, but have an additional 0-strand segment designated 
C . The D4 segments, or D4, have non-cysteine residues at the 
positions which typically correspond to cysteines in the other 
domains. In the type B allele shown, the residues are val(306) 
and met (3 64), while in the type A allele shown, the residues 
are val(313) and ile(371). The D4 domains also have 0-strand 
segments designated C f . The domain 5, or D5, have the 
consensus cysteine residues and the additional C» 0-strand 
segments, and the type B receptor polypeptide has an additional 
C" £-strand segment. 

The present invention provides for various constructs 
comprising ligand binding constructs, typically comprising 
substantially intact domains. These constructs will have 
various uses, e.g., for binding ligands, or substituting for 
intact receptor polypeptides. For example, each of the 
separate domains may comprise a separate polypeptide alone, or 
may be fused to another peptide, such as the TM and IR regions 
of a receptor polypeptide, e.g., hPDGF-R. See, e.g., Table 6. 
These individual single domain polypeptides will exhibit 
specific activity associated with these specific domains, 
preferably as an agonist or antagonist for ligand binding, 
preferably with characteristics shared with the intact receptor 
polypeptide or XR. The domains may also preferably serve as 
competitive inhibitors of PDGF-R polypeptides, competing with 
natural PDGF-receptors to bind ligands. The present invention 
also provides repetitive sequences of a single domain. For 
example, a Dl domain by itself is provided, a Dl-Dl dimer in a 
single polypeptide is provided, a D1-D1-D1 triplet repeat is 
also provided. Likewise up to a large number of Dl domains 
which will exhibit many functions, e.g., immunological 
properties, characteristic of various natural PDGF-R sequences. 
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Similar constructs of each of D2, D3, D4 , and D5 are provided, 
along with combinations. See Tables 6, 7, 8, 9 and 10- These 
will often be soluble fragments of the XR, or may be fused to 
other polypeptides, including a PDGF-R TM segment, preferably 
5 with an IR segment also. 
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TABLE 6 

XR domain structure of single domain forms 
Dl D3 D4 D5 



10 TABLE 7 

XR domain structure of two domain forms 

Dl-Dl D2-D1 D3-D1 D4-D1 D5-D1 

15 D1-D2 D2-D2 D3-D2 D4-D2 D5-D2 

D1-D3 D2-D3 D3-D3 D4-D3 D5-D3 

D1-D4 D2-D4 D3-D4 D4-D4 D5-D4 

D1-D5 D2-D5 D3-D5 D4-D5 D5-D5 



20 



25 



30 



45 



TABLE 8 

XR domain structure of three domain forms 

Dl-W D2-W D3-W D4-W D5-W 

where W is each of the 25 possible combinations listed in 
TABLE 2, giving a total of 125 elements in this table 



TABLE 9 

35 XR domain structure of four domain forms 

Dl-X D2-X D3-X D4-X D5-X 

where X is each of the 125 possible combinations 
40 listed in TABLE 5, giving a total of 625 elements in 

this table 



TABLE 10 

XR domain structure of five domain forms 

Dl-Y D2-Y D3-Y D4-Y D5-Y 

50 where Y is each of the 625 possible combinations 

listed in TABLE 6, but not including the combination 

D1-D2-D3-D4-D5, giving a total of 3124 elements in 
this table 



55 
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In addition, the present invention provides similar 
structures with spacer regions between the domain structures . 
In particular, the regions- corresponding to the intra-cysteine 
residues of the domains shown in Tables 4 and 5 are useful. 
5 For example, a spacer polypeptide may be inserted between 
adjacent domains or do spaces between the important ligand 
binding segments, typically found within the intra-cysteine 
segments described, e.g., the B, C, C», C", D, E, and F 0- 
strand segments. Thus, for example, a polypeptide of the 
10 structure D1-X1-D2 is provided where XI is a spacer segment 
which is not a D domain. The order of the domains may be 
reversed, and the invention also provides polypeptides such as 
D2-D1, or D2-X1-D1. In particular, the non-D domain character 
of XI is provided to avoid the peptide D1-X1-D3 from 
15 describing, or encompassing, D1-D2-D3. 

Another particularly preferred embodiment of the 
invention is a polypeptide having the described extracellular 
region domain structure combined with other segments of a human 
platelet-derived growth factor receptor, particularly the 
20 transmembrane segment (TM) and the intracellular region (IR) . 

Thus, the present invention provides for a receptor polypeptide 
which either has a modified order of the extracellular region 
domains in the amino to carboxy direction, e.g., a D5-D4-D3-D2- 
Dl-TM-IR polypeptide, or, in some cases reversal of various 
25 domains. It also provides for a receptor polypeptide with a 
deleted intact domain and for a receptor polypeptide having an 
additional domain added to it. Examples include D1-D2-D3-TM- 
IR, or D1-D2-D3-D4-TM-IR. In particular, fusions with the XR 
segments described in Tables 6, 7, 8, 9, and 10 are preferred 
30 embodiments. 

The modified combinations of the D domains are 
expected to both simulate and differ from the natural receptor. 
The modified polypeptide would be expected, in some 
embodiments, to exhibit a modified binding affinity, e.g., 
35 higher or lower affinity, or to exhibit a different spectrum of 
binding to different ligands or ligand analogues. They may 
also have an altered ligand binding transducing efficiency, or 
a modified inter-chain- association affinity. 
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The present invention provides the means for 
determining the minimal structural features necessary to 
perform various functions of the extracellular region of 
platelet-derived growth factor receptors, preferably human 
5 receptors. Although similar determinations may be performed in 
mouse or other mammalian species, the human receptor will 
typically be preferred for diagnostic or therapeutic purposes. 

To determine the minimal region necessary for a 
functional activity, e.g., ligand binding, an assay for that 
10 activity is developed. The main receptor functions, as 
indicated above, include ligand binding, tyrosine kinase 
activity, and receptor dimerization. Simple and quick assays 
for each of these molecular functions may be developed. Ligand 
binding assays are described, e.g., in Gronwald et al. (1988) 
15 P™r.. Nat'l * ~,d. Sci. USA 85:3435-3439; Heldin et al. (1988) 
EMBO J. 7:1387-1393; and Escobedo et al. (1988) Science 
240:1532-1534. Receptor dimerization assays are described, 
e.g., in Yarden and Schlessinger (1987) Biochemistry 26:1434- 
1442 and 1443-1451. 
20 As an alternative means for determining sites which 

interact with specific other proteins, physical structure 
determination, e.g. f x-ray crystallography or 2 dimensional NMR 
techniques, will provide guidance as to which amino acid 
residues form the molecular contact regions. For a detailed 
25 description of protein structural determination, see, e.g., 

Blundell and Johnson (1976) Protein cr ystallography. Academic 
Press, New York, which is hereby incorporated herein by 
reference. 

Ligand binding assays may include binding of labeled 
30 ligand or competition assays for binding. Signal transduction 
may be indirectly assayed by measuring an activity modulated by 
ligand binding, e.g., tyrosine kinase activity, or some measure 
of a conformational or other change in receptor structure. For 
example, an antibody or other binding protein which 
35 specifically binds or dissociaters from the receptor polypeptide 
upon ligand binding may be used. Receptor dimerization may be 
measured by a proximity assay, including a fluorescence 
quenching or other spectroscopic measurement. Various 
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proximity assays are known, see, e.g., Ullrich and Schlessinger 
(1990) Cell 61:203-212; Yarden and Schlessinger (1987) 
Biochemistry 26:1434-1942 and 1443-1451; each of which is 
hereby incorporated herein by reference. 
5 Once an assay has been developed, various 

combinations of domain or other segments , e.g., LBR's, can be 
tested for affecting that activity. A competitive inhibition 
assay will detect those constructs which can bind the ligand. 
The first domain structures to try will "ordinarily be the 

10 individual domains, either alone or linked to chimeric proteins 
or the TM-IR segment of the receptor. Various alleles, 
modifications to the individual domains, or related chimeric 
domains would be tested. Both deletion and chimeric proteins 
will be constructed. 

15 Various combinations of each domain will be 

constructed and tested to select those which affect the 
measured activity. Repeats of those domains should be tested, 
e.g,., Dl-Dl. If no single domain does affect the function, 
then various 2 domain constructs, in order, would be tried, 

20 e.g., D1-D2 -TM-IR, D2-D3 -TM-IR, D3-D4 -TM-IR, and D4-D5-TM-IR. 
Selected combinations listed in Tables 6, 7, 8, 9, and 10 will 
be constructed and tested. 

In order to produce soluble forms, it will often be 
desireable to attach appropriate amino terminal segments,, some 

25 of which would be expected to be present in the Dl domain or in 
the precursor form. Correct secretion and processing may be 
dependent upon various amino proximal features, such as signal 
sequences, and other features essential for correct targeting 
and processing. See, e.g., Watson et al. (1987) The Molecular 

30 Biology of the Gene, vols* 1 and 2., Benjamin, Menlo Park, 
California. 

When correct domains have been selected which are 
especially effective in modulating or competing defined 
functions, a more detailed analysis, to the level of the 
35 strand segments might be addressed. Various chimeric, 

deletion, insertion, or substitution constructs of each 0- 
strand or inter-strand segment may be generated and tested, as 
described above. Each construct could be produced using 
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methods of standard genetic engineering, especially using 
synthetic primers. Procedures for using such reagents are 
described, e.g., in Sambrook, et al. (1989) MoTer-nlar Cloning: 
K r^^ nr » Manual, vols. 1-3, Cold Spring Harbor Press, and 
Ausubel et al. (eds.) (1989) Current Prororols in Molecular 
Biolsay, Wiley, each- of which is hereby incorporated herein by 
reference. 

B. Soluble Forms 

In some embodiments, only the extracellular region is 
provided. Thus, the extracellular region alone, without the 
transmembrane segment, will often be a soluble polypeptide. It 
has been demonstrated that the entire extracellular region, 
separated from, and which lacks a transmembrane region and an 
intracellular region, still serves as a ligand binding 
polypeptide. In particular, the soluble polypeptide D1-D2-D3- 
D4-D5 has been demonstrated to bind various PDGF forms. 
Although the binding specificity for the PDGF form is 
dependent, to some extent, on the specific domains included, 
modifications to the specificity of the ligand binding may be 
effected by either substituting various different domains or 
rearranging the domains. Substitution with other homologous 
segments may also be performed, e.g., substituting an Ig-like 
domain from an antibody molecule, such as an antibody which 
binds a platelet-derived growth factor. Alternatively, a 
domain from a different related growth factor or ligand 
receptor may be substituted, e.g., from an FGF receptor or 
another PDGF receptor. The order of the domains may also be 
modified, e.g., D5-D4-D3-D2-D1. 

In particular, the activities which will usually be 
of greatest importance with the extracellular constructs relate 
to the binding of the ligand. For example, it has been 
discovered that domains D4 and D5 are not essential for ligand 
binding of a soluble extracellular region PDGF-R polypeptide. 
Of the remaining domains, if domain D3 is separated from 
domains Dl and D2, the construct D1-D2 binds the ligand only at 
low affinity, but a D1-D2-D3 construct binds ligand at high 
affinity. 
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A typical hPDGF-R nucleic acid sequence encodes a 
transitory amino terminal hydrophobic sequence, which is 
usually cleaved during the membrane translocation process. The 
classical function of a signal sequence is to direct the 
nascent polypeptide chain to membtane bound ribosomes, thereby 
leading to membrane translocation or cellular targeting. 
However, since the signal sequence is typically removed in the 
translocation process, the signal sequence is usually absent in 
a mature polypeptide. Often a signal sequence will be attached 
upstream of a desired soluble peptide of this invention. 

Solubility of a polypeptide depends upon the 
environment and the polypeptide. Many parameters affect 
polypeptide solubility, including the temperature, the 
electrolyte environment, the size and molecular characteristics 
of the polypeptide, .and the nature of the solvent. Typically, 
the temperature at which the polypeptide is used ranges from 
about 4*C to about 65 *C. Usually the temperature at use is 
greater than about 18 *C and more usually greater than about 
22'C. For diagnostic purposes, the temperature will usually be 
about room temperature or warmer, but less than the 
denaturation temperature of components in the assay. For 
therapeutic purposes, the temperature will usually be body 
temperature, typically about 37 # C for humans, though under 
certain situations the temperature may be raised or lowered in 
situ or in vitro. 

The electrolytes will usually approximate in situ 
physiological conditions, but may be modified to higher or 
lower ionic strength where advantageous . The actual ions may 
be modified to conform to standard buffers used in 
physiological or analytical contexts. 

The size and structure of the polypeptide should be 
in a substantially stable and globular state, and usually not 
in a denatured state. The polypeptide may be associated with 
other polypeptides in a quaternary structure, e.g., to confer 
solubility. 

The solvent will usually be a biologically compatible 
buffer, of a type used for preservation of biological 
activities, and will usually approximate a physiological 
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solvent. On some occasions, a detergent will be added, 
typically a mild non-denaturing one. 

Solubility is usually measured in Svedberg units, 
which are a measure of the sedimentation velocity of a molecule 
5 under particular conditions. The determination of the 
sedimentation velocity was classically performed in an 
analytical ultracentrifuge, but is typically now performed in a 
standard ultracentrifuge. See, Freif elder (1982) Physical 
Biochemistry (2d ed. ) , W.H. Freeman, and Cantor and Schimmel 
0 (1980) »*"P*Y^»1 Chemistry, parts 1-3, W.H. Freeman & Co., 
San Francisco, each of which is hereby incorporated herein by 
reference. As a crude determination, a sample containing a 
"soluble" polypeptide is spun in a standard full sized 
ultracentrifuge at about 5 OK rpm for about 10 minutes, and 
3 soluble molecules will remain in the supernatant. A soluble 
particle or polypeptide will typically be less than about 30S, 
more typically less than about 15S, usually less than about 
10S, more usually less than about 6S, and, in particular 
embodiments, preferably less than about 4S, and more preferably 
20 less than about 3S. 

This invention provides platelet-derived growth 
factor polypeptides and proteins having platelet-derived growth 
. factor receptor ligand binding activity. The receptors of the 
present invention include PDGF receptor amino acid sequences 
25 such as those shown in Tables 6, 7, 8, 9, and 10. Also 

provided are homologous sequences, allelic variations, induced 
mutants, alternatively expressed variants, and proteins encoded 
by DNA which hybridize under high stringency conditions to PDGF 
receptor encoding nucleic acids retrieved from naturally 
30 occurring material. 

The platelet-derived growth factor receptor peptides 
of the present invention will exhibit at least about 80% 
homology with naturally occurring domains of hPDGF receptor 
sequences in the domains Dl, D2, D3, D4, and D5, typically at 
35 least about 85% homology with a natural form of a receptor 

sequence, more typically at least about 90% homology, usually 
at least about 95% homology, and more usually at least about 
97% homology. 
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Homology, for polypeptides, is typically measured 
using sequence analysis software, see, e.g., Sequence Analysis 
Software Package of the Genetics Computer Group, University of 
Wisconsin Biotechnology Center, 1710 University Avenue, 
5 Madison, Wisconsin 53705. Protein analysis software matches 
similar sequences using measure of homology assigned to various 
substitutions, deletions, substitutions, and other 
modifications, similar, or homologous, substitutions for LBR 
segments will be made in known sequences, thereby producing new 
10 binding molecules having modified affinity or specificity of 
ligand binding. 

Various other software analysis programs can analyze 
the conformational structure of a polypeptide. Homologous 
conformation may also be achieved by appropriate insertion, 
15 deletion, substitution, or modification of amino acid 

sequences. Since the conformational structure of the domains 
and 0-strand segments is only partially understood, the present 
invention also encompasses various modifications to the 
sequences disclosed and retaining these structural features. 
20 In particular, ligand binding function is believed to 

be localized to the extracellular domain, particularly the 
LBR 1 s , and the soluble forms will preferably retain this 
particular function. Soluble fragments of PDGF receptors will 
be useful in substituting for or for interfering with, e.g., 
25 blocking, by competing for PDGF binding, the functions of the 
natural receptor both in vitro and in vivo . Alternatively, 
soluble forms may interfere with the dimerization of PDGF 
receptor polypeptides, since the proteins may normally be in, 
or function in, a dimer form. Receptor dimerization may be 
30 essential for proper physiological signal transduction, and 
v introduction of fragments may function to interrupt these 
processes by blocking their dimerization. 

PDGF receptor polypeptides may be purified using 
techniques of classical protein chemistry, see, e.g., Deutscher 
35 (ed.) (1990) Guide to Purification ; Methods in Enzymology, Vol. 
182, which is hereby incorporated herein by reference. 
Alternatively, a lectin affinity chromatography step may be 
used, or a highly specific ligand affinity chromatography 
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procedure, e.g., one that utilizes a PDGF conjugated to biotin 
through cysteine residues of the protein mitogen. Purified 
PDGF receptor polypeptides may also be obtained by a method 
such as PDGF affinity chromatography using activated CH- 
5 Sepharose coupled to PDGF through primary amino groups as 
described in Imamura et al. (1988) Biochem. Bioohvs. Res. 
Commun. 155:583-590. 

Depending on the availability of specific antibodies, 
specific PDGF receptor peptide constructs may also be purified 

10 using immuno-affinity chromatography. Antibodies prepared, as 
described below, may be immobilized to an. inert substance to 
generate a highly specific immuno-aff inity column. See, e.g., 
Harlow and Lane (1990) Monoclonal Antibodies: — h Laboratory 
Manual . Cold Spring Harbor Laboratory, which is hereby 

15 incorporated herein by reference. 

Various cells or tissues may be selected as starting 
materials, usually selected on the basis of abundant expression 
of the desired receptor construct or polypeptide. High 
expression promoter sequences may be operably linked to a 

20 recombinant sequence, preferably an inducible promoter. The 
promoter is operably linked when it operates to promote the 
sequence. Appropriate cells that contain relatively large 
amounts of the receptor protein, as determined by high affinity 
binding of PDGF, can be transformed with variants of the PDGF 

25 receptor polypeptides. These may be used to replace the 

natural form of PDGF receptor by a construct with a deletion or 
insertion. 

The ligand binding regions (LBR's) or other segments 
may be "swapped" between different new fusion constructs or 

30 fragments. Thus, new chimeric polypeptides exhibiting new 

combinations of segments can result from the structural linkage 
of different functional domains. Ligand binding regions which 
confer desired or modified specificities may be combined with 
other domains which have another function, e.g., each Ig-like 

35 domain could be substituted by a similar domain from other 
related polypeptides, or LBR's between different alleles or 
similar receptors may be combined. 
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The present invention also provides for fusion 
polypeptides between the receptor polypeptide domains and other 
homologous or heterologous proteins. Homologous proteins may 
be fusions between similar but different growth factor 
5 receptors resulting in f e.g., a hybrid protein exhibiting 

ligand specificity of one receptor with an intracellular domain 
of another, or a receptor which may have altered affinity or a 
broadened or narrowed specificity of binding. Likewise, 
heterologous fusions may be constructed which exhibit a 
10 combination of properties or activities of the derivative 
proteins. Typical examples are fusions of a reporter 
polypeptide, e.g., lucif erase, with a domain of a receptor, 
e.g., a ligand binding domain from the extracellular region of 
a human platelet-derived growth factor receptor, so that the 
15 presence or location of a desired ligand may be easily 

determined. See, e.g., Dull et al., U*S. Patent No. 4,859,609, 
which is hereby incorporated herein by reference. Other gene 
fusion partners include bacterial /J-galactosidase, trpE, 
protein A, ^-lactamase, a-amylase, alcohol dehydrogenase, and 
20 yeast cr-mating factor. See, e.g., Godowski et al., (1988) 
Science 241: 812-816. Additional sequences with various 
defined functions may be found by searching through the 
GenBank™ (National Institutes of Health) sequence data bank. A 
heterologous fusion protein is one which includes sequences not 
25 naturally found in conjunction with one another. Thus, a 

heterologous fusion protein may be a fusion of two similar, and 
homologous, sequences. 

Fusion proteins "would typically be made by either 
recombinant nucleic acid methods with expression, or by 
30 synthetic polypeptide methods. Techniques for nucleic acid 

manipulation are described generally, for example, in Sambrook 
et al. (1989) ftp} ecujar Cloning; A LafroyatprY franual (2nd ed.) 
volumes 1-3, Cold Spring Harbor laboratory, which is hereby 
incorporated herein by reference. Techniques for synthesis of 
35 polypeptides are described, for example in Merrifield (1963) 0\ 
Amer. Chem. Soc. 85:2149-2456; Atherton et al. (1989) Solid 
Phase Peptide Synthesis: A Practical Approach, IRL Press, 
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Oxford; and Merrifield (1986) Science 232:341-347? each of 
which is hereby incorporated herein by reference. 

The recombinant nucleic acid sequences used to 
produce fusion proteins of the present invention may be derived 
5 from natural or synthetic sequences. Many natural gene 
sequences are available from various cDNA or from genomic 
libraries using appropriate probes, see, e.g., GenBank™, 
National Institutes of Health". 

Typical probes for isolating platelet-derived growth 

10 factor receptor genes may be selected from sequences of Tables 
1 and 2, in accordance with standard procedures, suitable 
synthetic DNA fragments may be prepared, e.g., by the 
phosphoramidite method described by Beaucage and Carruthers 
(1981) Tetra. Letts. 22:1859-1862. A double stranded fragment 

15 may then be obtained by either synthesizing the complementary 
strand and hybridizing the strands together under appropriate 
conditions or by adding the complementary strand using DNA 
polymerase with an appropriate primer sequence. 

20 III. Nucleic Acids 

The present invention provides nucleic acid sequences 
encoding various PDGF receptor sequences described above. 
Tables 1 and 2, respectively set forth the corresponding cDNA 
sequences encoding human type B and type A PDGF receptor 

25 polypeptides. 

Substantial homology in the nucleic acid context 
means either that the segments , or their complementary strands , 
when compared, are the same when properly aligned, with 
appropriate nucleotide insertions or deletions, in at least 

30 about 60% of the residues, typically at least about 70%, more 
typically at least about 80%, usually at least about 90%, and 
more usually at least about 95 to 98% of the nucleotides. 
Appropriate nucleotide insertions or deletions include 
interdomain sequences, or those external to the cysteines 

35 within a domain, but the sequences within the paired cysteines 
(or their equivalents in the D4 domains) will often be very 
important to retain. Structural homology will exist when there 
is at least about 55% homology over a stretch of at least about 
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14 nucleotides, typically at least about 65%, more typically at 
least about 75%, usually at least about 90%, and more usually 
at least about 95% or more. 

Alternatively, substantial homology exists when the 
5 segments will hybridize under selective hybridization 

conditions, to a strand, or its complement, typically using a 
sequence of at least about 20 contiguous nucleotides derived 
from Table 1 or 2. However, larger segments would usually be 
preferred, e.g., at least about 30 contiguous nucleotides, more 
10 usually at least about 40, and preferably more than about 50. 
Selectivity of hybridization exists when hybridization occurs 
which is more selective than total lack of specificity. See, 
Kanehisa (1984) Nucleic Acids Res. 12:203-213, which is 
incorporated herein by reference. 
15 Stringent hybridization conditions will normally 

include salt concentrations of less than about 1 M, typically 
less than about 700 mM, more typically less than about 500 ibM, 
usually less than about 400 mM, more usually less than about 
300 mM, and preferably less than about 200 mM. Temperature 
20 conditions will typically be greater than about 20 # C, more 

typically greater than about 25 *C, usually greater than about 
30*C, more usually greater than about 37 # C, and preferably in 
excess of about 40 *C, depending upon the particular 
application. As other factors may significantly affect the 
25 stringency of hybridization, including, among others, base 

composition and size of the complementary strands, presence of 
organic solvents, and extent of base mismatching, the 
combination of parameters is more important than the absolute 
measure of any one. 
30 Probes may be prepared based on the sequence of the 

PDGF receptor encoding sequences provided in Tables 1 and 2. 
The probes may be used to isolate other PDGF receptor nucleic 
acid sequences by standard methods. See, e.g., Sambrook et al. 
(1989) Molecular Cloning; A Laboratory Manual , vols. 1-3, CSH 
35 Press, N.Y., which is hereby incorporated herein by reference. 
Other similar nucleic acids may be selected for by using 
homologous nucleic acids • Alternatively, nucleic acids 
encoding these same or similar receptor polypeptides may be 
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synthesized or selected by making use of the redundancy in the 
genetic code. Various codon substitutions may be introduced, 
. e.g. , silent changes thereby providing various convenient 
restriction sites, or to optimize expression for a particular 
5 system, e.g>, to match the optimum codon usage. Mutations may 
be introduced to modify the properties of the receptors, 
perhaps to change the ligand binding affinities, the inter- 
chain affinities r or the polypeptide degradation or turnover 
rate. 

10 The DNA compositions of this invention may be derived 

from genomic DNA or cDNA, prepared by synthesis or may be a 
hybrid of the various combinations. Recombinant nucleic acids 
comprising sequences otherwise not naturally occurring in 
continuity are also provided by this invention. An isolated 

15 DNA sequence includes any sequence that has been obtained by 
primer or hybridization reactions or subjected to treatment 
with restriction enzymes or the like. 

Synthetic oligonucleotides can be formulated by the 
triester method according to Matteucci et al. (1981) J . Am. 

20 Chem. Soc. 103:3185 or by other methods such as commercial 

automated oligonucleotide synthesizers. Oligonucleotides can 
be labeled by excess polynucleotide kinase (e.g., about 10 
units to 0.1 nanomole substrate is used in connection with 50 
[■" mM Tris r pH 7.6, 5 mM dithiothreitol , 10 mM MgCl 2 , 1-2 mM ATP, 

25 1.7 pmoles 32 P-ATP (2.9 mci/mmole) 0.1 mM spermidine ,0.1 mM 

EDTA) • Probes may also be prepared by nick translation, Klenow 
fill-in reaction, or other methods known in the art. See, 
e.g. , Sambrook et al. 

. cDNA or genomic libraries of various types may be 

30 screened for new alleles or related sequences. The choice of 
cDNA libraries normally corresponds to a tissue source which is 
abundant in mRNA for the desired receptors. Phage libraries 
are normally preferred, but plasmid libraries may also be used. 
Clones of a library are spread onto plates, transferred to a 

35 substrate for screening, denatured, and probed for the presence 
of desired sequences. 

For example, with a plaque hybridization procedure, 
each plate containing bacteriophage plaques is replicated onto 
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duplicate nitrocellulose filter papers (Millipore-HATF) . The 
phage DNA is denatured with a buffer such as 500 mM NaOH, 1.5 M 
Nacl for about 1 minute, and neutralized with, e.g., 0.5 M 
Tris-HCl, pH7.5, 1.5 M NaCI (3 times for 10 minutes each). 
The filters are then washed. After drying, the filters are 
typically baked, e.g., for 2 hours at 80 *C in a vacuum oven. 
The duplicate filters are prehybridized at 42 'C for 4-24 hours 
with 10 ml per filter of DNA hybridization buffer (20-50% 
formamide, 5X SSC, pH 7.0, 5X Denhardt's solution 
(polyvinylpyrrolidone, plus Ficoll and bovine serum albumin? 
IX = 0.02% of each), 50 mM sodium phosphate buffer at pH 7.0, 
0.2% SDS, and 50 Mg/ml denatured salmon sperm DNA). 
Hybridization with an appropriate probe may be performed at 
42 *c for 16 hrs with 10 ml/filter of l x 10* cpm/ml of DNA 
hybridization buffer containing radioactively labeled probe. 
The final concentration of formamide is varied according to the 
length of the probe and the degree of stringency desired. See, 
e.g., Wetmur and Davidson (1968) J. Mol. Biol" 31:349-370; and 
M. Kanehisa (1984) Nuc. Acids each of which is 

incorporated herein by reference, for a discussion of 
hybridization conditions and sequence homology. 

An oligonucleotide probe based on the disclosed amino 
acid sequences may be used to site specifically mutate or 
generate recombinant .fusion or deletion constructs. See, e.g., 
Tables u and 12 for preferred oligonucleotide reagents. 
Procedures such as those described by Kimbel et al. (1987) 
Methods in , Sn?yftPloqY £5*1:367 , may be used. The sequences PA1 
through PA9 correspond to Seq. ID No. 6 through 14, 
respectively, and sequences PA101 through PA109 correspond to 
Seq. ID No. 15 through 23, respectively. 
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TABLE IX 

HDJOH B-type PDGF-R MUTAGENESIS OLIGOMERS 

Domain 5 / 3'KonCoding 

5 t CCA CAC TCC TTG CCC TTT AAG / TAGCTTCCTGTAGGGGGCTG 3' . 
PHSLPFK/* ********** 



Domain 4 / 
5« TCC TTC GAC CTA CAG ATC AAT / 
SFQLQIN/ 



3»NonCoding 
TAGCTTCCTGTAGGGGGCTG 3 1 
* ********** 



Domain 3 / 3»NonCoding 

PA3 5' ATC ACC GTG GTT GAG AGC GGC / TAGCTTCCTGTAGGGGGCTG 3' 
ITV VESG/* ********** 

Domain 2 / 3»Noncoding . 

PA4 5' TAC AGA CTC CAG GTG TCA TCC / TAGCTTCCTGTAGGGGGCTG 3 



Y 



RLQVSS/* ********** 



Domain i / 3»NonCoding 

PA5 5» CTC TAC ATC TTT GTG CCA GAT CCC / TAGCTTCCTGTAGGGGGCTG 3 

L YIFVPDP /* ******** 



Signal Sequence : Domain 1 / D 2^ i 2Jl ^ riT 

PA6 5 ' CAG ATC TCT CAG GGC: CTG GTC /ACC GTG GGC TTC CTC CCT AAT CAT 3 
Q 1 S Q G : L V /T V G F L P N D 



PA7 



Sional Seouence : Domain I / Domain 3 ■ 

5> CAG ATC TCT GGC: CTG GTC/ ATC AAC GTC TCT GTG AAC GCA GTG CAG3 

Q I S Q G : L V/I H V S V N A V Q 



Signal. Sequence : Domaxn 1 / ILr era rie ctg - • 

PA8 5» CAG ATC TCT CAG GGC: CTG GTC / TAC GTG CGG CTC CTG GGA GAG CTG - 
q I S Q G:L V/Y V R L L G E V 

Signal Sequence : Domain 1 / Dom ^± 1 l w » *™ 

PA9 5' CAG ATC TCT CAG GGC : CTG GTC / GTC CGA GTG CTG. GAG CTA AGT 3 
QISQG :I*V /VRVL WLA 
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TABLE 12 

PROPOSED HDXAK A-type PDGF-R MUTAGENESIS OLIGOMERS 

Doaain 5 / 3»NonCoding 

PA101 5' GCT CCC ACC CTG CGT TCT GAA / TAACTGGCGGATTCGAGGGG 3» 
APTLRSE/ * ********** 



Doaain 4 / 3»NonCoding 

PA102 5» GAA CTG TTA ACT CAA GTT CCT / TAACTGGCGGATTCGAGGGG 3' 
ELLT QV P / * ********** 



Domain 3 / S^onCoding 

PA103 5* ATT TCT GTC CAT GAG AAA GGT / TAACTGGCGGATTCGAGGGG 
I S V H E K G / * ********** 



Doaain 2 / 3'NonCoding 

PA104 5 1 TAT GCT TTA AAA GCA ACA TCA / TAACTGGCGGATTCGAGGGG 3 » 
YALKATS/* ********** 



Domain 1 ./ 3»NonCoding 

PA105 5 1 ATT TAC ATC TAT GTG CCA GAC CCA / TAACTGGCGGATTCGAGGGG 3 1 
IVI YVPDP/* ********** 



Signal Sequence : Domain 1 / Domain 2 

PA106 5 ' AGC CTA ATC CTC TCC CAG CTT / GAT GTA GCC TTT GTA CCT CTA GGA 3' 
. 5 L I L C : Q t/DVAPV PLG 



Signal Sequence : Domain 1 / Domain 3 

PA107 5 1 AGC CTA ATC CTC TGC CAG CTT/ GAG CTG GAT CTA GAA ATG GAA GCT CTT 3 » 
S Zi I L C : Q L / E L D L E K E - A L 



Signal Sequence : Domain 1 / Domain 4 

PA108 5 1 AGC CTA ATC CTC. TGC CAG CTT / TTC ATT GAA ATC AAA CCC ACC TTC 3' 
S L 1 LC:Q L/F I E I K PT F 



Signal Sequence : Domain 1 / Domain 5 

PA109 5 1 AGC CTA ATC CTC TGC CAG CTT / TCA TCC ATT CTG GAC TTG GTC 3* 
SLILC:QL/SSILD LV 
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In accordance with this invention any isolated DNA 
sequence which encodes substantially a PDGF-R complete 
structural sequence can be used as a probe* Alternatively, any 
DNA sequence that encodes a PDGF-R hydrophobic signal sequence 
5 and its translational start site may be used. An isolated 

partial DNA sequence which substantially encodes intact domains 
exhibiting PDGF-R activity (e.g., ligand or PDGF-R binding) is 
also part of this invention. Preferred probes are cDNA clones 
of PDGF receptor polypeptides. 

10 The DNA sequences used in this invention will usually 

comprise intact domain structures, typically at least about 5 
codons (15 nucleotides) , more typically at least about 9 
codons, usually at least about 13 codons, more usually at least 
about 18 codons, preferably at least about 25 codons and more 

15 prefetably at least about 35 codons. One or more introns may 
also be present. This number of nucleotides is usually about 
the minimal length required for a successful probe that would 
hybridize specifically with a PDGF receptor sequence. For 
example, epitopes characteristic of a PDGF-R may be encoded in 

20 short peptides. Usually the wild-type sequence will be 
employed, in some instances one or more mutations may be 
t introduced, such as deletions, substitutions, insertions, or 
inversions. These modifications may result in changes in the 
amino acid sequence, provide silent mutations, modify a 

25 restriction site, or provide specific mutations. The genomic 
sequence will usually not exceed about 200 kb, more usually not 
exceed about 100 kb, preferably not greater than about 0.5 kb. 

Portions of the DNA sequence having at least about 10 
nucleotides from a DNA sequence encoding an PDGF receptor 

30 peptide will typically be used, more typically at least about 
15 nucleotides, usually at least about 20 nucleotides, more 
usually at least about 25 nucleotides, and preferably at least 
about 30 nucleotides. The probes will typically be less than 
about 6 kb, usually fewer than about 3.0 kb, and preferably 

35 less than about 1 kb. The probes may also be used to determine 
whether mRNA encoding a specific PDGF-R is present in a cell or 
different tissues. 



10 
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The natural or synthetic DNA fragments coding for a 
desired platelet-derived growth factor receptor fragment will 
usually be incorporated into DNA constructs capable of 
introduction to and expression in an in vitro cell culture. 
Often the DNA constructs will be suitable for replication in a 
unicellular host, such as yeast or bacteria, but may also be 
intended for introduction to, with and without integration 
within the genome, cultured mammalian, or plant or other 
eukaryotic cell lines. Human cells may be preferred hosts. 
Higher eukaryote host cells will often be preferred because 
their glycosylat ion and protein processing patterns more likely 
simulate human processing. DNA constructs prepared for 
introduction into bacteria or yeast will typically include a 
replication system recognized by the host, the intended DNA 
15 fragment encoding the desired receptor polypeptide construct, 
transcriptional and translational initiation regulatory 
sequences operably linked to the polypeptide encoding segment, 
and transcriptional and translational termination regulatory 
sequences operably linked to the polypeptide encoding segment. 
20 The transcriptional regulatory sequences will typically include - 
a heterologous enhancer or promoter which is recognized by the 
host. The selection of an appropriate promoter will depend 
upon the host, but promoters such as the trp, lac, and phage 
promoters, tRNA promoters, and glycolytic enzyme promoters are 
25 known and available. See, e.g., Sambrook et al. (1989). 

Conveniently available expression vectors which include the 
replication system and transcriptional and translational 
regulatory sequences together with the insertion site for the 
platelet-derived growth factor receptor DNA sequence may be 
30 employed. Examples of workable combinations of cell lines and 
expression vectors are described, e.g. , in Sambrook et al. 
(1989); see_^ls2, Metzger et al. (1988) Nature 334:31-36. 

Expression vectors for these cells can include 
expression control sequences, such as an origin of replication, 
35 a promoter, an enhancer and necessary processing information 
sites, e.g., ribosome-binding sites, RNA splice sites, 
polyadenylation sites, and transcriptional terminator 
sequences. Preferably, the enhancers or promoters will be 




WO 92/13867 PCT/US92/0O730 

.54 

those naturally associated with genes encoding the PDGF 

receptor polypeptides, although it will be understood that in 
-many cases others will be equally or more appropriate* Other 

preferred expression control sequences are enhancers or 
5 promoters derived from viruses, such as SV40, Adenovirus, 

Bovine Papilloma Virus, and the like. 

Similarly, preferred promoters are those found 

naturally in immunoglobul in-producing cells, see , e.g., U.S. 

Patent No. 4,663,281, which is incorporated herein by 
10 reference, but SV40, polyoma virus, cytomegalovirus (human or 

murine) and the LTR from various retroviruses, e.g., murine 

leukemia virus, murine or Rous sarcoma virus and HIV, may be 

utilized, as well as promoters endogenous to PDGF-R genes. 

See , Enhancers and Eukarvotic Gene Expression , (1983) Cold 
15 Spring Harbor Press, N.Y., which is incorporated herein by 

reference. 

The vectors containing the DNA segments of interest, 
e.g., a PDGF receptor polypeptide gene or cDNA sequence, can be 
transferred into the host cell by well-known methods, which 

20 vary depending on the type of cellular host. For example, 
calcium chloride trans feet ion is commonly utilized for 
prokaryotic cells , whereas calcium phosphate treatment may be 
used for other cellular hosts. See generally, Sambrook et al. 
(1989) Molecular Cloning: A Laboratory Manual (2d ed.) CSH 

25 Press, which is incorporated herein by reference. The term 
"transformed cell" is meant to also include the progeny of a 
transformed cell. 

As with the purified polypeptides, the nucleic acid 
segments associated with the ligand-binding segment, the 

30 extracellular domain, and the intracellular domain are 

particularly useful. These gene segments will be used as 
. probes for screening for new genes exhibiting similar 
biological activities, though the controlling elements of these 
genes may also be of importance. 

35 
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IV. Methods for Making PDGF Receptor Polypeptide 
Constructs 

DNA sequences may also be used to express PDGF-R 
polypeptides* For example, a DNA sequence of from about 21 
5 nucleotides (encoding about 7 amino acids) to about 2.1 kb- 
(about 700 amino acids) may be used to express a polypeptide 
having a PDGF receptor specific activity, typically 
ligand-binding. In particular, constructs retaining the ligand 
binding regions will be useful, as these constructs will 

10 possess binding activity. 

In particular, various synthetic linkers and probes 
nay be constructed to facilitate genetic engineering of the 
PDGF-R nucleic acid sequences. Polymerase chain reaction (PCR) 
techniques can be applied to producing large quantities of 

15 fragments or segments useful in the proper manipulation of the 
sequences encoding the constructs. See, e.g., Innis et al. 
{1990 > E£B Protocols, Academic Press. Alternatively, nucleic 
acid synthesizers can produce sufficiently large quantities of 
fragments for hybridizing to any preselected sequence, e.g., 

20 from Table 1 or 2, or for manipulating the sequence to add or 
delete specific domains or segments. Particularly important 
segments will be the LBR f s. 

Large quantities of the receptor proteins may be 
prepared by expressing the whole receptor or parts of the 

25 receptor -contained in the expression vehicles in compatible 

hosts such as E^.coii, -yeast, mammalian cells, insect cells, or 
frog oocytes. The expression vehicles may be introduced into 
the cells using methods, well known in the art such as calcium 
phosphate precipitation (discussed below) , lipofectin 

30 electroporation, or DEAE dextran transformation. 

Usually the mammalian cell hosts will be immortalized 
cell lines. To study the characteristics of a PDGF-R and its 
corresponding ligand, it will be useful to transfect, or 
transform mammalian cells which lack or have low levels of a 

35 PDGF receptor. Preferably, a signal sequence can serve to 
direct the peptide to the cell membrane or for secretion. 
Cells lacking significant amounts of PDGF receptors include 
Chinese hamster ovary (CHO) cells, most epithelial cell lines, 
and various human tumor cell lines. 
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Transformed or transfected cells can be selected 
which incorporate a DNA sequence which encodes a receptor that 
is functionally equivalent to a wild-type receptor thereby 
conferring a PDGF-sensitive mitogenic response. Such cells 
5 will enable the analysis of the binding properties of various 
added PDGF receptor polypeptides. Transfected cells may also 
be used to evaluate the effectiveness of a composition or drug 
as a PDGF antagonist or agonist. The level of receptor 
tyrosine kinase activity or the rate of nucleic acid synthesis 

10 can be determined by contacting transfected cells with drugs or 
ligands and comparing the effects of various ligand analogues 
against the controls • Although the most common procaryote 
cells used as hosts are strains of E. coli , other prokaryotes 
such as Bacillus subtilis or Pseudomonas may also be used. The 

15 DNA sequences of the present invention, including fragments or 
portions of the sequence encoding for receptor polypeptides 
comprising intact structural domains, a portion of the 
receptor, or a polypeptide having an PDGF-R activity, can be 
used to prepare an expression vehicle or construct for a PDGF-R 
. 20 polypeptide or polypeptide having a PDGF-R activity. Usually 
the control sequence will be a eukaryotic promoter for 
expression in a mammalian cell. In some vehicles the 
receptor's own control sequences may also be used. A common 
prokaryotic. plasmid vector for transforming E. coli is pBR322 

25 or its derivatives, e.g. the plasmid pkt279 (Clontech) , see 

Bolavar et al. (1977) Gene , 2:95. The prokaryotic vectors may 
also contain prokaryotic promoters for transcription 
initiation, optionally with an operator. Examples of most 
commonly used prokaryotic promoters include the beta-lactamase 

30 (penicillinase) ; lactose (lac) promoter, see Cheng et al. 
(1977) Nature , 198:1056; tryptophan promoter (trp) , see 
Goeddell et al. (1980) Nucleic Acid Res. , 8: 457); P L promoter; 
and the N-gene ribosome binding site, see Shimatake et al. 
(1981) Nature , 292:128-; each of which is hereby incorporated 

35 herein by reference. 

Promoters used in conjunction with yeast can be 
promoters derived from the enolase gene, see Holland et al. 
(1981) .t. Biol. Chem. , 256:1385 ; or the promoter for the 
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synthesis of glycolytic enzymes such as 3-phosphoglycerate 
kinase, see Hitzeman et al. (1980) J. Biol. Chem. . 255:. 

Appropriate non-native mammalian promoters will 
include the early and late promoters from SV40, see Fiers et 
5 al. (1978) Najjuxe, 273:113; or promoters derived from murine * 
muloney leukemia virus, mouse mammary tumor virus, avian 
sarcoma viruses, adenovirus II, bovine papilloma virus, or 
polyoma. In addition, the construct may be joined to an 
amplifiable gene, e.g. dihydrofolate reductase (DHPR) so that 

10 multiple copies of the PDGP receptor gene may be made. See, 
e.g., Kaufman et al. (1985) Mol. and Pell. Biol. 5:1750-1759; 
and Levinson et al. EPO publication nos. 0117059 and 0117060, 
each of which is incorporated hereby by reference. 

Prokaryotes may be transformed by various methods, 

15 including using CaCl 2 , see Cohen (1972) Proc. Nat'l Acad. fic< . 
BSh, 69:2110; or the Rbcl method, see Maniatis et al. (1982) 
Molecular Cloning: A Labnr^nry Manual r cold Spring Harbor 
Press. Yeast may be transformed, e.g., using a method 
described by Van Solingen et al. (1977) J. Bacterid . 130:946; 

20 or Hsiao et al. (1979) Proc. Nat'l Acad. SrH . USA 76:3829. 

With respect to eukaryotes, mammalian cells may be transfected 
using a calcium phosphate precipitation method, see, e.g., 
Graham and van der Eb (1978) viroloov . 52:546; or by lipofectin 
(BRL) or retroviral infection, see, e.g., Gilboa (1983) 

25 frtPerimental Manipulation of ft ?n e Express-inn , chap. 9, Academic 
Press P. 175. The actual expression vectors containing 
appropriate sequences may be prepared according to standard 
techniques involving ligation and restriction enzymes. See 
e.g., Maniatis supra. Commercially available restriction 

30 enzymes for cleaving specific sites of DNA may be obtained from 
New England BioLabs, Beverly, Massachusetts. 

Particular cotransformations with other genes may be 
particularly useful. For example, it may be desired to co- 
express the nucleic acid with another processing enzyme. Such 

35 enzymes include signal peptidase, tertiary conformation 

conferring enzymes, or glycosylating enzymes. This expression 
method may provide processing functions which otherwise might 
be lacking in the expression host, e.g., mammalian-like 
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glycosylation in a prokaryote expression system. 
Alternatively > the host cell selected for expression may be 
chosen on the basis of the natural expression of those 
processing enzymes. 
5 Cell clones are selected by using markers depending 

on the mode of the vector construction. The marker may be on 
the same or a different DNA molecule preferably the same DNA 
molecule. With mammalian cells the receptor gene itself may be 
the best marker. In prokaryotic hosts the trans formant may be 

10 selected by resistance to ampicillin, tetracycline, or other 
antibiotics. Production of a particular product based on 
temperature sensitivity or compensation may serve as 
appropriate markers. Various methods may be used to harvest 
and purify the PDGF-R receptor protein or peptide fragment. 

15 The peptide may be isolated from a lysate of the host. The 
peptide may be isolated from the cell supernatant if the 
peptide is secreted. The PDGF-R peptide is then further 
purified as discussed above using HPLC, electrophoresis , or 
affinity chromatography, e.g., immuno-aff inity or ligand 

20 affinity. 

Another method which can be used to isolate cDNA 
clones of PDGF-R related species involves the use of the 
polymerase chain reaction (PCR) . See, e.g. r Saiki et al. 
(1985) Science 230:1350. In this approach two oligonucleotides 

25 corresponding to distinct regions of the PDGF-R sequence are 
synthesized and then used in the PCR reaction, typically to 
amplify receptor-related mRNA transcripts from an mRNA source. 
Annealing of the oligonucleotides and PCR reactions are 
performed under conditions of reduced stringency. The 

3 0 resulting amplified fragments are subcloned, and the resulting 
recombinant colonies are probed with 32 P-labeled full-length 
PDGF-R cDNA. Clones which hybridize under, low but not high 
stringency conditions represent PDGF-R related mRNA 
transcripts. This approach can also be used to isolate variant 

35 PDGF-R cDNA species which arise as a. result of alternative 

splicing, see Frohman et al. (1988) Proc. Nat'l Acad. Sci. USA, 
85:8998. 
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V. Antibodies 

Polyclonal and/or monoclonal antibodies to the 
various PDGF receptor constructs, receptor peptides, and 
peptide fragments may also be prepared . Peptide fragments may 
5 be prepared synthetically in a peptide synthesizer and coupled 
to a carrier molecule (i.e., keyhole limpet hemocyanin) and 
injected into rabbits over several months. The rabbit sera is 
tested for immunoreactivity to the PDGF receptor protein or 
fragment. Monoclonal antibodies may be made by injecting mice 
10 with PDGF-R protein, PDGF-R polypeptides, or mouse cells 

expressing high levels of the cloned PDGF receptor on its cell 
surface. Monoclonal antibodies will be screened by ELISA and 
tested for specific immunoreactivity with the PDGF receptor 
protein or polypeptides thereof . See, Harlow and Lane (1988) 
15 Antibodies; A Laboratory Manual ,- -CSHarbor Press, which is 

hereby incorporated herein by reference. These antibodies will 
be useful in assays as well as pharmaceuticals. 

Once a sufficient quantity of the desired PDGF 
receptor polypeptide construct has been obtained, the protein 
20 may be used for various purposes. A typical use is the 
production of antibodies specific for binding to epitopes 
characteristic of these receptors. These antibodies may be 
either polyclonal or monoclonal and may be produced by in, vitro 
or in vivo techniques. 
25 For production of polyclonal antibodies, an 

appropriate target immune system is selected, typically a mouse 
or rabbit. The substantially purified antigen is presented to 
the immune system in a fashion determined by methods 
appropriate for the animal and other parameters well known to 
30 immunologists. Typical sites for injection are in the 

footpads, intramuscularly, intraperitoneally, or intradermal ly. 
Of course, another species may be substituted for a mouse or 
rabbit, typically a mammal, but possibly a bird or other 
animal. 

35 An immunological response is usually assayed with an 

immunoassay. Normally such immunoassays involve some 
purification of a source of antigen, for example, produced by 
the same cells and in the same fashion as the antigen was 
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produced. The immunoassay may be a radioimmunoassay , an 
enzyme-linked assay (ELISA) , a fluorescent assay, or any of 
many other choices, most of which are functionally equivalent 
but may exhibit particular advantages under specific 
5 conditions. 

Monoclonal antibodies with affinities of at least 
about 10 6 IT 1 preferably 10 s ' 10 1Q , or higher will be made by 
standard procedures as described, e.g. , in Harlow and Lane, 
(1988) Antibodies: A Laboratory Manual, CSH Press; or Coding, 

10 (1986) tfopoclonal Antibodies: Principles and Practice (2d ed) 
Academic Press, New York, which are hereby incorporated herein 
by reference. Briefly, appropriate animals will be selected 
and the desired immunization protocol followed. After the 
appropriate period of time, the spleens of such animals are 

15 excised and individual spleen cells fused, typically, to 
immortalized myeloma cells under appropriate selection 
conditions. Thereafter the cells are clonally separated and 
the supematants of each clone are tested for their production 
of an appropriate antibody specific for the desired region of 

20 the antigen. 

Other suitable techniques involve in vitro exposure 
of lymphocytes to the antigenic polypeptides or alternatively 
to selection of libraries of antibodies in phage or similar 
vectors. See . Huse et al. "Generation of a Large Combinatorial 

25 Library of the Immunoglobulin Repertoire in Phage Lambda," 
Science 246:1275-1281 (1989), hereby incorporated herein by 
reference. The polypeptides and antibodies of the present 
invention may be used with or without modification. 
Frequently, the polypeptides and antibodies will be labeled by 

30 joining, either covalently or non-covalently, a substance which 
provides for a detectable signal. A wide variety of labels and 
conjugation techniques are known and are reported extensively 
in both the scientific and patent literature. Suitable labels 
include radionuclides, enzymes, substrates, cof actors, 

35 inhibitors, fluorescens, chemiluminescers, magnetic particles 
and the like. Patents, teaching the use of such labels include 
U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 
4,277,437? 4,275,149; and 4,366,241. Also, recombinant 
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immunoglobulins may be produced , see Cabilly, U.S. Patent 
No. 4,816,567, 

Antibodies of particular interest are those raised 
against the ligand binding regions. These will include some 
5 antibodies which function as ligands. Or, antibodies may be . 
used to select for compounds which could serve as ligands for 
modified receptors. See, e.g., Meyer (1990) Nature 347:424- 
425; and Pain et al. (1990) Nature 347:444-447; each of which 
is hereby incorporated herein by reference. 

10 

VIII. Methods for Use 

The present invention provides platelet-derived 
growth factor receptor (PDGF-R) polypeptide purification 
methods as. well as methods for synthesizing PDGF receptors 

15 within cells. Also provided are homogeneous receptors produced 
by these methods, nucleic acid sequences encoding the receptors 
or portions of the receptors, as well as expression vehicles 
containing these sequences, cells comprising the PDGF- 
receptors, and antibodies to the' receptors. In particular, the 

20 present invention provides methods for assaying binding and 
other activities of receptor-like proteins having rearranged 
combinations of the domains. 

The extracellular region of the human type B PDGF 
receptor protein has been used to successfully bind PDGF BB 

25 ligand in a receptor activation assay. PDGF BB ligand binding 
to NIH3T3 cell-associated PDGF receptors is measured. Ligand 
binding causes phosphorylation (activation) of the cell 
associated receptors. Receptor phosphorylation is followed in 
a multi-step process which first involves solubilization of 

30 NIH3T3 cells and separation of cell proteins by electrophoresis 
of cell extracts on sodium dodecyl sulfate polyacrylamide gels. 
Gels are blotted onto nitrocellulose and treated with anti- 
phosphotyrosine monoclonal antibodies to aid in the detection 
of phosphorylated PDGF receptor. Monoclonal antibodies are 

35 visualized through autoradiography of antibody-associated 125-1 
protein A which has been introduced at the terminal stage of 
the assay. 



\ 
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If human type B receptor protein (at about a 60 fold 
molar excess to PDGF BB ligand) is preincubated with ligand for 
1 hour prior to incubation with NIH3T3 cells, there is no cell- 
associated PDGF receptor phosphorylation. This indicates that 
5 the human type B PDGF receptor protein binds PDGF BB ligand in 
solution and prevents the ligand from activating cell- 
associated PDGF receptors. Thus, polypeptides which contain 
LBR's may be used to block normal PDGF responses. 

The domain containing structures of the present 
10 invention will find use both as diagnostic and therapeutic 
reagents. The receptor polypeptides may be used as affinity 
reagents for detecting or binding ligand, as well as for 
interacting with receptor-like proteins, e.g., affecting 
receptor protein dimerization. The polypeptides will also be 
15 useful as reagents for detecting or purifying other proteins 
which associate with the receptors or fragments thereof. 

The receptor polypeptides will also find use in 
generating other reagents, e.g., antibodies specific for 
binding epitopes peculiar to the modified receptors. In 
20 particular, antibodies raised against newly formed ligand 
binding determining segments may serve as ligands for the 
modified receptors. These techniques may provide for 
separating various functionalities of the receptors, thereby 
isolating each of the different effector functions from others, 
25 in response to PDGF binding. 

The modified receptors of the present invention also 
provide methods for assaying ligands for them. For example, 
soluble ligand binding fragments will be useful as competing 
sites for ligand binding, a useful property in a ligand binding 
30 assay. In particular, the present invention provides an assay 
to screen for PDGF binding inhibition, allowing screening of 
large numbers of compounds. These compounds may be assayed in 
vitro, which allows testing of cytotoxic or membrane disruptive 
compounds. The present solid phase system allows reproducible, 
35 sensitive, specific, and readily automated assay procedures. 
Polystyrene 96-well plates may be coated with the appropriate 
construct with LBR's to assay for ligand binding activity. 
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Moreover, modifications to the ligand binding domains 
will lead to binding region combinations with different ligand 
binding affinities. Thus, modulation of ligand effected 
response may be easily achieved by inclusion of the appropriate 
5 affinity modified analogue. 

Solid phase assays using these modified receptors may 
also be developed, providing greater sensitivity or improved 
capacity over unmodified binding regions . 

Diagnostic kits comprising these reagents are also 

10 provided. The kit typically comprise a compartmentalized 

enclosure, e.g., a plastic substrate having diagnostic reagents 
of the invention attached thereto* The package will typically 
also include various buffers, labeling reagents, and other 
reagents as appropriate for the diagnostic test to be 

15 performed. Instructions for use of the related reagents and 
interpretation of the results will be provided. 

In particular, the important functional segment of 
the extracellular domain will usually be attached to a plastic 
or other solid phase substrate. The binding regions will 

20 usually be selected for a combination of the affinity and 
ligand binding spectrum of the modified binding segments. 
Appropriate ligands will often be introduced to determine the 
ligand binding, activity and affinity. Different LBR 
combinations will be used, and can be used to test for 

25 differently modified, e.g., labeled, ligands. 

-In addition, the peptides will.be useful for 
therapeutic administration. The quantities of "reagents 
necessary for effective therapy will depend upon many different 
factors, including means of administration, target site, 

30 physiological state of the patient, and other medicants 

administered. Thus, treatment dosages should be titrated to 
optimize safety and efficacy* Typically, dosages used in vitro 
may provide useful guidance in the amounts useful for in situ 
administration of these reagents.. Animal testing of effective 

35 doses for treatment of particular disorders will provide 
further predictive indication of human dosage. Various 
considerations are described, e.g., in Gilman et al. (eds) , 
(1990) Goodman and Gilman's: The Pharmacological Basis of 
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Thera peutics , 8th ed. , Pergamon Press? and Reminqton's 
^.^^ntical sciences , (1985) 7th ed. , Mack Publishing Co. , 
Easton, Penn. ; each of which is hereby incorporated by 
reference. Methods for administration are discussed therein, 
5 e.g., for oral, intravenous, intraperitoneal, or intramuscular 
administration, transdermal diffusion, and others. 
Pharmaceutical^ acceptable carriers will include water, 
saline, buffers, and other compounds described, e.g., in the 
Merck Index, Merck & Co., Rahway, New Jersey. Because of the 
10 high affinity binding between PDGF and its receptors, low 
dosages of these reagents would be initially expected to be 
effective. Thus, dosage ranges would ordinarily be expected to 
be in amounts lower than 1 mM concentrations, typically less 
than about 10 jiM concentrations, usually less than about 
15 100 nM, preferably less than about 10 pM (picomolar) , and most 
preferably less than about 1 fM (femtomolar) , with an 
appropriate carrier. 

The pharmaceutical compositions will be administered 
by parenteral, topical, oral or local administration, such as 
20 by aerosol or transdermal ly, for prophylactic and/or 

therapeutic treatment. The pharmaceutical compositions can be 
administered in a variety of unit dosage forms depending upon 
•the method of administration. For example, unit dosage forms 
suitable for oral administration include powder, tablets, 
25 pills, capsules and dragees. 

Preferably, the pharmaceutical compositions are 
administered intravenously. Thus, this invention provides 
compositions for intravenous administration which comprise a 
solution of the compound dissolved or suspended in an 
30 acceptable carrier, preferably an aqueous carrier. A variety 
of aqueous carriers may be used, e.g., water, buffered water, 
0.4% saline, and the like. These compositions may be 
sterilized by conventional, well known sterilization 
techniques, or may be sterile filtered. The resulting aqueous 
35 solutions may be packaged for use as is, or lyophilized, the 
lyophilized preparation being combined with a sterile aqueous 
solution prior to administration. The compositions may contain 
. pharmaceutically acceptable auxiliary substances as required to 
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approximate physiological conditions , such as pH adjusting and 
buffering agents, tonicity adjusting agents , wetting agents and 
the like, for example, sodium acetate, sodium lactate, sodium 
chloride, potassium chloride, calcium chloride, sorbitan 
5 moholaurate, triethanolamine oleate, etc. 

For solid compositions, conventional nontoxic solid 
carriers may be used which include, for example, pharmaceutical 
grades of mannitol, lactose, starch, magnesium stearate, sodium 
saccharin, talcum, cellulose, glucose, sucrose, magnesium 

10 carbonate, and the like. For oral administration, a 

pharmaceutical ly acceptable nontoxic composition is formed by 
incorporating any of the normally employed excipients, such as 
those carriers previously listed, and generally 10-95% of 
active ingredient, preferably about 20% (see, Remington's . 

15 s^pra) . 

For aerosol administration, the compounds are 
preferably supplied in finely divided form along with a 
surfactant and propellant. The surfactant must, of course,, be 
nontoxic, and preferably soluble in the propellant. 

20 Representative of such agents are the esters or partial esters 
of fatty acids containing from 6 to 22 carbon atoms, such as 
caproic, octanoic, lauric, palmitic, stearic, linoleic, 
linolenic, olesteric and oleic acids with an aliphatic 
polyhydric alcohol or its cyclic anhydride such as, for 

25 example, ethylene glycol, glycerol, erythritol, arabitol, 
mannitol, sorbitol, the hexitol anhydrides derived from 
sorbitol, and the poly oxy ethylene and polyoxypropylene 
derivatives of these esters. Mixed esters, such as mixed or 
natural glycerides may be employed. The surfactant may 

30 constitute 0.1%-20% by weight of the composition, preferably 
0.25-5%. The balance of the composition is ordinarily 
propellant. Liquefied propellants are typically gases at 
ambient conditions, and are condensed under pressure. Among 
suitable liquefied propellants are the lower alkanes containing 

35 up to 5 carbons, such as butane and propane; and preferably 
fluorinated or fluorochlorinated alkanes. Mixtures of the 
above may also be employed. In producing the aerosol, a 
container equipped with a suitable valve is filled with the 
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appropriate propellant, containing the finely divided compounds 
and surfactant. The ingredients are thus maintained at an 
elevated pressure until released by action of the valve. 

The compositions containing the compounds can be 
5 administered for prophylactic and/ or therapeutic treatments. 

In therapeutic applications , compositions are administered to a 
patient already suffering from a disease, as described above, 
in an amount sufficient to cure or at least partially arrest 
the symptoms of the disease and its complications. An amount 
10 adequate to accomplish this is defined as "therapeutically 

effective dose." Amounts effective for this use will depend on 
the severity of the disease and the weight and general state of 
the patient. 

In prophylactic applications, compositions containing 
15 the compounds of the invention are administered to a patient 

susceptible to or otherwise at risk of a particular disease. 

Such an amount is defined to be a "prophylactically effective 

dose. * In this use, the precise amounts again depend on the 

patient's state of health and weight. 
20 The invention will better be understood by reference 

to the following illustrative examples. The following examples 

are of f ered by way of illustration and not by way of 

limitation. 

25 EXPERIMENTAL 

In general, standard techniques of recombinant DNA 
technology are described in various publications, e.g., 
Sambrook et al. (1989) Molecular cloning : A Laboratory Manual, 
Cold Spring Harbor Laboratory; Ausubel et al. (1987) Current 

30 Protocols in Molecular Biology , vols. 1 and 2 and supplements; 
and Wu and Grossman (eds.) (1987) Methods in Enzymology, Vol. 
53 (Recombinant DNA Part D) ; each of which is incorporated 
herein by reference. 

35 - I. Human Extracellular Region 

Equivalent techniques for construction, expression, 
and determination of the physiological effect of truncation or 
deletion analogues of the soluble extracellular receptor 
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fragments from the human receptor may be performed using the 
nucleic acid r polypeptide, and other reagents provided herein. 

A. Type B Segments 

Constructs of type B receptor polypeptides were made 
as follows: 

The 3.9 kb EcoRI-Hind III cDNA fragment of the human 
type B hPDGF-R was subcloned into the EcoRI-Hind III site of 
M13 Mpl8 to produce a vector Mpl8PR. For techniques, see 
Maniatis et al. (1982) Molecular Cloning; A Laboratory Manual, 
Cold Spring Harbor, N.Y., which is incorporated herein by 
reference. Verification of subcloning was performed by 
restriction enzyme digestion analysis and dideoxy chain 
termination sequencing , as described by Sanger et al. (1977) 
Proc . Nat ' 1 Acad . Sci . USA 74:5463. Oligonucleotide directed 
in vitro mutagenesis was performed according to the method 
described by Kunkel et al. (1987) Methods in Enzvmol . . 154:367. 
The strategy for oligonucleotide directed in vitro deletion 
mutagenesis of Mpl8PR is outlined in Fig. 1. 

In brief, a series of oligonucleotides were designed 
to create a nested set of soluble type B hPDGF receptor 
extracellular regions by deletion mutagenesis. These domains 
are designated Domain 1 through Domain 5 (D1-D5) , suitable for 
expression in an appropriate eukaryotic expression system. A 
description of the mutagenic oligonucleotides aligned with the 
corresponding -regions of the human PDGF receptor are listed in 
Table 10. The resulting constructs are labeled as indicated in 
Table 13. The antisense strand was used for mutagenesis 
throughout. Mutagenesis of PA1, PA2, PA3, PA4, and PAS , 
utilized Mpl8PR as the template and mutagenesis of PA6, PA7 , 
PA8 , and PA9 , utilized MP 18 PA1 as the template. PA1, a 41 bp 
oligomer, introduced a TAG stop codon after Lysine 499 (K 499 ) of 
D5 and removed the transmembrane (TM) as well as entire 
intracellular kinase domain (K) , producing an Mpl8 PA1 (see 
Fig;. 1). PA1 codes for 530 148 precursor proteins. 
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TABLE 13 

HUMAN TYPE B PDGF-R EXPRESSION CONSTRUCTS 



Soluble Membrane Bound 

pBJPR 



10 pBJPAl 
pBJPA2 
pBJPA3 
pBJPA4 
pBJPAS 

15 pBJPA6 
pBJPA7 
PBJPA8 
pBJPA9 



20 



The human PDGF receptor constructs were subsequently 
25 subcloned into the EcoRI-Hind III site of pBJl a derivation of 
pCDL-SRa296, as described in Takabe et al. (1988) Molec. Cell, 
Biol. 8:466, and co-trans fected with pSV2NE0 , as described by 
Southern and Berg (1982) J. Mol. Apol. Gen. . 1: 327, into 
Chinese hamster ovary cells (CHO) . See Pigs. 2 and 3. 
30 Function of the constructs was . demonstrated, as 

follows: 

A sample of 0.33 nM PDGF BB ligand is preincubated 
for 1 hr at 4*C under the following conditions: 

1. a polyclonal antibody to human PDGF (this antibody 
35 recognizes human PDGF AA, PDGF BB and PDGF AB) ; 

2. 18 nM (60 fold molar excess to PDGF BB) human type B 
PDGF receptor; 

3. phosphate buffered saline solution that the receptor 
and antibody are in; or 

40 4. no additions but the ligand itself. 

In a duplicate set of experiments, 0.33 nM PDGF AA is 
incubated with three of the above preincubation conditions , 
e.g., 2, 3, and 4 above. The human type B PDGF receptor does 
not appreciably recognize PDGF AA but this ligand will still 

45 activate cell-associated human type A PDGF receptor from NIH3T3 
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cells and so is a control for human type B PDGF receptor 
specificity and PDGF BB-dependent activation versus non- 
specific general cellular effect, e.g., cytotoxicity. 

The preincubated materials were in a final volume of 
5 0.5 ml. They were placed in one well each of a six well tissue 
culture dish containing a confluent layer of serum starved 
(quiescent) NIH3T3 cells which were chilled to 4'C. The cells 
and incubation mixtures were agitated, e.g., rocked, at 4*C for 
2 h. They were then washed twice with 4*C phosphate buffered 

10 saline. Forty pi of 125 mM Tris(hydroxymethyl) amino methane 
(Tris), pH 6.8, 20% (v/v) glycerol, 2% (w/v) sodium dodecyl 
sulfate (SDS), 2% (v/v) 2-mercaptoethanol , and 0.001% 
bromphenol blue, (known as SDS sample buffer) , was added per 
microtiter well followed by 40 pi of 100 mM Tris, pH 8.0, 30 mM 

15 sodium pyrosphoshate, 50 mM sodium fluoride, 5 mM 
ethylenediaminetetraacetic acid (EDTA) , 5 mM 

ethylenebis(oxyethylenenitrilio)tetraacetic acid, 1% (w/v) SDS, 

100 mM dithiothreitol, 2 mM phenylmethylsulf onylfluoride 

(PMSF) , and 200 /iM sodium vanadate was added to the cells. The 

20 cells were solubilized and 40 /xl additional SDS sample buffer 
was added to the solubilizate. This material was boiled 5 
minutes and loaded onto a single gel sample well of a 7.5% 
sodium dodecyl sulfate polyacrylamide gel. Cellular proteins 
were, separated by electrophoresis. - 

25 The separated proteins were transferred to 

nitrocellulose by electrotransfer and the resulting "Western 
blot" was incubated with 3 changes of 0.5% (w/v) sodium 
chloride, 5 mg/ml bovine serum albumin, 50 mM Tris, pH 7.5, 
(designated blocking buffer) for 20 minutes each at room 

30 temperature. A 1/1000 dilution of PY20 (a commercially 

available monoclonal antibody to phosphotyrosine [ICN]) in 
blocking buffer was incubated with the blot overnight at 4'C. 
The blot was washed 3 times for 20 minutes each at room 
temperature in blocking buffer. The blot was incubated with 4 

35 /iCi/40 ml of 12S I-Protein A [Amersham] in blocking buffer for 1 
hour at room temperature and washed 3 times for 20 minutes each 
at room temperature in blocking buffer. The blot was exposed 
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to X-ray film for 48 h with one intensifying screen at -70 - c 
and developed with standard reagents. 

Figure 4 shows the results of the autoradiogram with 
the conditions mentioned above plus the additional condition of 
5 no added ligand (no PDGF) . This added condition defines the 
level of cell-associated receptor activation (e.g», 
phosphorylation) in the absence of any added ligand. Both the 
antibody and the human type B PDGF receptor neutralized the 
activation of cell-associated PDGF receptor by PDGF BB. This 
10 is apparently due to direct binding and sequestration of the 
ligand making it unavailable for PDGF receptor activation. 
p!85 shows the receptor position. 

i 

B. Type A Sequence 

15 Similar manipulations using the mutagenic 

oligonucleotides of Table 12 are used to construct the type A 
constructs listed in Table 15. Note that the type A constructs 
have not actually been produced, but would readily be produced 
by these methods. Similar assays are used to test the function 

20 of the constructs. 



TABLE 15 

SUGGESTED HUMAN TYPE A PDGF-R EXPRESSION CONSTRUCTS 

type A 

Soluble Membrane pound; 

pARSR 

pARSAl 
pARSA2 
pARSA3 
pARSA4 
pARSA5 
pARSA6 
pARSA7 
pARSA8 
pARSAS 



30 



35 



45 
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C. PDGF Plate Assay 

Polystyrene microtiter plates (Inunulon f Dynatech 
Laboratories) were coated with the extracellular region 
fragment of the type B human PDGF receptor (described above) by 
5 incubating approximately 10-100 ng at this protein per well in 
100 /il of 25 mM Tris r 75 mM NaCl, pH 7.75 for 12 to 18 h at 
4*C. The protein was expressed in transfected CHO cells and 
collected in serum-free media (Gibco MEMa) at a concentration 
of 0.2 - 1 nq/ml, with a total protein concentration of 
10 150 - 300 

The human PDGF type B receptor extracellular region 
fragment was concentrated and partially- purified by passing the 
media over wheat germ-agglutinin-sepharose at 4*C (at 48 ml/h) 
in the presence of 1 mM PMSF. After extensive washing, the 

15 protein was eluted in 0*3 M N-acetyl-glucosamine, 25 mM Hepes, 
100 mM NaCl, 1 mM PMSF, pH 7.4. This fraction was then applied 
to Sephacryl S-200 HR (Pharmacia) equilibrated in 0,15 M 
ammonium bicarbonate pH 7.9. The fractions containing receptor 
(3 s- 10 ng//xl) were detected by SDS-PAGE and Western blotting 

20 with a polyclonal rabbit antibody, made by standard methods, 
against a Domain 1 (Dl) segment from the receptor external 
region. These fractions (3-10 ng//il) were used to coat the 
microtiter wells as described above. The wells were then 
drained, rinsed once with 200 /xl each of 0.5% gelatin (Bio-Rad, 

25 EIA grade), 25 mM Hepes, 100 mM NaCl, pH 7.4, and incubated for 
1-2 h at 24 *C with 150 /xl of this same solution. The wells 
were drained and rinsed twice with 0.3% gelatin, 25 mM Hepes, 
100 mM NaCl, pH 7.4 (150 /xl each). 90 pi of the 0,3% gelatin 
solution was put in each well (wells used to test nonspecific 

30 binding received just 80 pi and then 10 fil of 0.01 mg/ml non- 
labeled PDGF in the 0,3% gelatin solution). PDGF BB (Amgen) 
was iodinated at 4*C to 52,000 CPM/ng with di-iodo Bolton- 
Hunter reagent (Amersham) and approximately 40,000 CPM was 
added per well in 10 /*1, containing 0.024% BSA, 0.4% gelatin, 

35 20 mM Hepes, 80 mM NaCl, 70 mM acetic acid, pH 7.4. The plate 
was incubated for 2-3 h at 24 *C, after which wells were washed 
three times _with 150 fil each with 0.3% gelatin, 25 mM Hepes, 
100 mM NaCl, pH 7.4. The bound radioactivity remaining was 
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solubilized from the wells in 200 Hi 1% SDS, 0.5% BSA, and 
counted in a gamma-counter. The nonspecific binding was 
determined in the presence of a 150-fold excess of ^ abe ^ 
PDGF BB (Amgen) and was about 7% of the total bound I-PDGF. 
5 similar assays will be possible using type A receptor 

fragments. However, the type A receptor fragments are more 
sensitive to the presence of other proteins than the type B 
fragments, and appear to require a different well coating 
reagent from the gelatin. Hemoglobin is substituted for 
10 gelatin in the buffers at about the same concentrations, other 
blocking proteins will be useful selected from, e.g., the Sigma 
Chemical Company. Titrations to optimize the protein type and 
concentration will be performed to find proteins which do not 
affect the receptor protein binding. 
15 The present assays require less than 5 ng/well of 

receptor soluble form, which was expressed in transfected CHO 
cells, and partially purified by affinity and gel 
chromatography. Using iodinated PDGF-BB, the specific binding 
of less than 10 pg of ligand can be detected in an assay volume 
20 of 100 ng/well. At 4'C, the binding of l2S I-PDGF BB to 

immobilized receptor is saturable and of high affinity. The Kd 
by Scatchard analysis was about 1 nM with 1.8 x 10 10 sites per 
well. The nonspecific binding, determined in the presence of a 
100-fold excess of cold PDGF BB, was usually only about 5-10% 
25 of the total binding. The binding was also specific for the 
isoform of the ligand, insofar as excess cold PDGF AA did not 
inhibit ^I-PDGF BB binding. Furthermore, the external region 
of the type B PDGF receptor in solution competes with its 
immobilized form for binding iodinated PDGF BB (IC 50 = 5nM) . 
30 The ^I-PDGF BB bound after 4 h at 4'C is only slowly 

dissociable in binding buffer (t 1/2 > 6 h) , but is completely 
displaced by the addition of a 150-fold excess of unlabeled 

PDGF BB (t 1/2 < 1 h) . " 

These studies were made possible by the availability 
35 of growth factor preparations devoid of contamination with 
other growth factors and by the use of a receptor expression 
system in which all of the measured PDGF responses could be 
attributed to this single transfected receptor cDNA. 
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All publications and patent applications herein are 
incorporated by reference to the same extent as if each 
individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 
The invention now being fully described , it will be apparent to 
one of ordinary skill in the art that many changes and 
modifications can be made thereto without departing from the 
spirit or scope of the appended claims. 
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GGAGGGGGTG ACTGTCCAGA GCCTGGAACT GTGCCCACAC CAGAAGCCAT CAGCAGCAAG IBO 

GACACC ATG CGG CTT CCG GGT GCG ATG CCA GCT CTG GCC CTC AAA GGC 228 
Met Arg Leu Pro Gly Ala Met Pro Ala Leu Ala Leu Lys Gly 
1 5 10 

GAG CTG CTG TTG CTG TCT CTC CTG TTA CTT CTG GAA CCA CAG ATC TCT 276 
Glu Leu Leu Leu Leu Ser Leu Leu Leu Leu Leu Glu Pro Gin lie Ser 
15 20 25 30 

CAG GGC CTG GTC GTC ACA CCC CCG GGG CCA GAG CTT GTC CTC AAT GTC 324 
Gin Gly Leu Val Val Thr Pro Pro Gly Pro Glu Leu Val Leu Asn Val 
35 40 45 

TCC AGC ACC TTC GTT CTG ACC TGC TCG GGT TCA GCT CCG GTG GTG TGG 372 
Ser Ser Thr Phe Val Leu Thr Cys Ser Gly ser Ala Pro Val Val Trp 
50 55 60 

GAA CGG ATG TCC CAG GAG CCC CCA CAG GAA ATG GCC AAG GCC CAG GAT 420 
Glu Arg Met Ser Gin Glu Pro Pro Gin Glu Met Ala Lys Ala Gin Asp 

65 70 75 

GGC ACC TTC TCC ACC GTG CTC ACA CTG ACC AAC CTC ACT GCG CTA GAC 468 
Gly Thr Phe Ser Ser Val Leu Thr Leu Thr Asn Leu Thr Gly Leu Asp . 
80 85 90 

ACG GGA GAA TAC TTT TGC ACC CAC AAT GAC TCC CGT GGA CTG GAG ACC 516' 
Thr Gly Glu Tyr Phe cys Thr His Asn Asp ser Arg Gly Leu Glu Thr 
95 100 105 110 

GAT GAG CGG AAA CGG CTC TAC ATC TTT GTG CCA GAT CCC ACC GTG GGC 564 
Asp Glu Arg Lys Arg Leu Tyr lie Phe Val Pro Asp Pro Thr Val Gly 
115 120 125 

TTC CTC CCT AAT GAT GCC GAG GAA CTA TTC ATC TTT CTC ACG GAA ATA 612 
Phe Leu Pro Asn Asp Ala Glu Glu Leu Phe He Phe Leu Thr Glu He 
130 135 140 

ACT GAG ATC ACC ATT CCA TGC CGA GTA ACA GAC CCA CAG CTG GTG GTG 660 
Thr Glu He Thr He Pro Cys Arg Val Thr Asp Pro Gin Leu Val Val 
145 . 150 155 



ACA CTG CAC GAG AAG AAA GGG GAC GTT GCA CTG CCT GTC CCC TAT GAT 708 
Thr Leu His Glu Lys Lys Gly Asp Val Ala Leu Pro Val Pro Tyr Asp 
160 165 170 



CAC CAA CGT GGC TTT TCT GGT ATC TTT GAG GAC AGA AGC TAC ATC TGC 756 
His Gin Arg Gly Phe Ser Gly He Phe Glu Asp Arg Ser Tyr He Cys 
175 180 185 190 

AAA ACC ACC ATT GGG GAC AGG GAG GTG GAT TCT GAT GCC TAC TAT GTC . 804 
Lys Thr Thr lie Gly Asp Arg Glu Val Asp Ser Asp Ala Tyr Tyr Val 
195 200 205 

TAC AGA CTC CAG GTG TCA TCC ATC AAC GTC TCT GTG AAC GCA GTG CAG 852 
Tyr Arg Leu Gin Val Ser Ser He Asn Val Ser Val Asn Ala Val Gin 
210 215 220 

ACT GTG GTC CGC CAG GGT GAG AAC ATC ACC CTC ATG TGC ATT GTG ATC 900 
Thr Val Val Arg Gin Gly Glu Asn He Thr Leu Met cys He Val He 
225 230 235 

GGG AAT GAT GTG GTC AAC TTC GAG TGG ACA TAC CCC CGC AAA GAA AGT 948 
Gly Asn Asp Val Val Asn Phe Glu Trp Thr Tyr Pro Arg Lys Glu Ser 
240 245 250 

GGG CGG CTG GTG GAG CCG GTG ACT GAC TTC CTC TTG GAT ATG CCT TAC 996 
Gly Arg Leu Val Glu Pro Val- Thr Asp Phe Leu Leu Asp Met Pro Tyr 
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His lie Arg Ser lie Leu His lie Pro Ser Ala Glu Leu Glu Asp Ser 

275 280 285 

GGG ACC TAC ACC TGC AAT GTG ACG GAG AGT GTG AAT GAC CAT CAG GAT 1092 
Gly Thr Tyr Thr cys Asn Val Thr Glu Ser Val Asn Asp His Gin Asp 
290 295 300 

GAA AAG GCC ATC AAC ATC ACC GTG GTT GAG AGC GGC TAC GTG CGG CTC 1140 
Siu Lys Ala lie Asn He Thr Val val Glu Ser Gly Tyr Val Arg Leu 
305 310 315 

CTG GGA GAG GTG GGC ACA CTA CAA TTT GCT GAG CTG CAT CGG AGC CGG 1188 
Liu Gly Glu Val Gly Thr Leu Gin Phe Ala Glu Leu His Arg ser Arg 
320 * 325 330 

ACA CTG CAG GTA GTG TTC GAG GCC TAC CCA CCG CCC ACT GTC CTG TGG 1236 
Thr Leu Gin Val Val Phe Glu Ala Tyr Pro Pro Pro Thr Val Leu Trp 
335 340 345 350 

TTC AAA GAC AAC CGC ACC CTG GGC GAC TCC AGC GCT GGC GAA ATC GCC 1284 
Phe Lys Asp Asn Arg Thr Leu Gly Asp Ser Ser Ala Gly Glu lie Ala 
355 360 365 

CTG TCC ACG CGC AAC GTG TCG GAG ACC CCG TAT GTG TCA GAG CTG ACA 1332 

Leu Ser Thr Arg Asn Val Ser Glu Thr Arg Tyr Val Ser Glu Leu Thr 
370 375 380 

. CTG GTT CGC GTG AAG GTG CCA GAG GCT GGC CAC TAC ACC ATG CGG GCC 1380 
teu Val Arg Val Lys Val Ala Glu Ala Gly His Tyr Thr Met Arg Ala 
385 390 395 

TTC CAT GAG GAT GCT GAG GTC CAG CTC TCC TTC CAG CTA CAG ATC AAT 1428 
Phe His Glu Asp Ala Glu Val Gin Leu Ser Phe Gin Leu Gin He Asn 
400 405 410 

GTC CCT GTC CGA GTG CTG GAG CXA AGT GAG AGC CAC CCT GAC AGT GGG 1476 
val Pro Val Arg Val Leu Glu Leu ser Glu Ser His Pro Asp Ser Gly 
415 420 425 430 

GAA CAG ACA GTC CGC TGT CGT GGC CGG GGC ATG CCG CAG CCG AAC ATC 1524 
Glu Gin Thr Val Arg cys Arg Gly Arg Gly Met Pro Gin Pro Asn lie 
435 440 445 

ATC TGG TCT GCC TGC AGA GAC CTC AAA AGG TGT CCA CGT GAG CTG CCG 1572 
He Trp Ser Ala cys Arg Asp Leu Lys Arg Cys Pro Arg Glu Leu Pro 

450 455 460 

CCC ACG CTG CTG GGG AAC AGT TCC GAA GAG GAG AGC CAG CTG GAG ACT 1620 
Pro Thr Leu Leu Gly Asn Ser Ser Glu Glu Glu Ser Gin Leu Glu Thr 
465 470 475 

AAC GTG ACG TAC TGG GAG GAG GAG CAG GAG TTT GAG GTG GTG AGC ACA 1668 
Asn Val Thr Tyr Trp Glu Glu Glu Gin Glu Phe Glu Val Val Ser Thr 
480 485 .490 

CTG CGT CTG CAG CAC GTG GAT CGG CCA CTG TCG GTG CGC TGC ACG CTG 1716 
Leu Arg Leu Gin His Val Asp Arg Pro Leu Ser Val Arg Cys Thr Leu 
495 500 505 510 

CGC AAC GCT GTG GGC CAG GAC ACG CAG GAG GTC ATC GTG GTG CCA CAC 1764 
Arg Asn Ala Val Gly Gin Asp Thr Gin Glu Val He Val Val Pro His 
515 520 525 

TCC TTG CCC TTT AAG GTG GTG GTG ATC TCA GCC ATC CTG GCC CTG GTG 1312 
Ser Leu Pro Phe Lys Val Val Val He Ser Ala He Leu Ala Leu Val 
530 535 540 

GTG CTC ACC ATC ATC TCC CTT ATC ATC CTC ATC ATG CTT TGG CAG AAG 1860 
Val Leu Thr He He Ser Leu He He Leu He Met Leu Trp Gin Lys 
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560 565 570 

GAC GGC CAT GAG TAC ATC TAC GTG GAC CCC ATG CAG CTG CCC TAT GAC . 1956 

Asp Gly His Glu Tyr He Tyr Val Asp Pro Met Gin Leu Pro Tyr "Asp 
575 580 585 590 

TCC ACG TGG GAG CTG CCG CGG GAC CAG CTT GTG CTG GGA CGC ACC CTC 2004 
Ser Thr Tip Glu Leu Pro Arg Asp Gin Leu Val Leu Gly Arg Thr Leu 
595 600 605 

GGC TCT GGG GCC TTT GGG CAG GTG GTG CAG GCC ACA GCT CAT GGT CTG 2052 
G1Y Ser Gly Ala Phe Gly Gin Val Val Glu Ala Thr Ala His Gly Leu 
1 610 615 620 

AGC CAT TCT CAG GCC ACG ATG AAA GTG GCC GTC AAG ATG CTT AAA TCC 2100 
Ser His Ser Gin Ala Thr Met Lys Val Ala Val Lye Met Leu Lys Ser 
625 630 635 

ACA GCC CGC AGC ACT GAG AAG CAA GCC CTT ATG TCG GAG CTG AAG ATC 2148 
Thr Ala Arg Ser Ser Glu Lys Gin Ala Leu Met Ser Glu Leu Lys He 
640 645 650 

ATG" AGT CAC CTT GGG CCC CAC CTG AAC GTG GTC AAC CTG TTG GGG GCC 2196 
Met Ser His Leu Gly Pro His Leu Asn Val Val Asn Leu Leu Gly Ala 

655 660 665 670 

TGC ACC AAA GGA GGA CCC ATC TAT ATC ATC ACT GAG TAC TGC CGC TAC 2244 
Cys Thr Lys Gly Gly Pro He Tyr He He Thr Glu Tyr Cys Arg Tyr 
675 680 685 

GGA GAC CTG GTG GAC TAC CTG CAC CGC AAC AAA CAC ACC TTC CTG CAG 2292 
Gly Asp Leu Val Asp Tyr Leu His Arg Asn Lys His Thr Phe Leu Gin 
1 690 695 700 

CAC CAC TCC GAC AAG CGC CGC CCG CCC AGC GCG GAG CTC TAC AGC AAT 2340 
His -His Ser Asp Lys Arg Arg Pro Pro Ser Ala Glu Leu Tyr Ser Asn 
705 710 715 

GCT CTG CCC GTT GGG CTC CCC CTG CCC AGC CAT GTG TCC TTG ACC GGG 2388 
Ala Leu Pro Val Gly Leu Pro Leu Pro Ser His Val Ser Leu Thr Gly 
720 725 730 

GAG AGC GAC GGT GGC TAC ATG GAC ATG AGC AAG GAC GAG TCG GTG GAC 2436 
Glu Ser Asp Gly Gly Tyr Met Asp Met Ser Lys Asp Glu Ser Val Asp 
735 740 745 750 

TAT GTG CCC ATG CTG GAC ATG AAA GGA GAC GTC AAA TAT GCA GAC ATC 2484 

Tyr val Pro Met Leu Asp Met Lys Gly Asp val Lys Tyr Ala Asp He 
755 760 765 

GAG TCC TCC AAC TAC ATG GCC CCT TAC GAT AAC TAC GTT CCC TCT GCC 2532 
Glu Ser Ser Asn Tyr Met Ala Pro Tyr Asp Asn Tyr Val Pro Ser Ala 
770 " 775 780 

CCT GAG AGG ACC TGC CGA GCA ACT TTG ATC AAC GAG TCT CCA GTG CTA 2580 
Pro Glu Arg Thr Cys Arg Ala Thr Leu He Asn Glu Ser Pro Val Leu 
785 790 795 

AGC TAC ATG GAC CTC GTG GGC TTC AGC TAC CAG GTG GCC AAT GGC ATG 2628 
Ser Tyr Met Asp Leu Val Gly Phe Ser Tyr Gin Val Ala Asn Gly Mat 
800 805 810 

GAG TTT CTG GCC TCC AAG AAC TGC GTC CAC AGA GAC CTG GCG GCT AGG 2676 

Glu Phe Leu Ala Ser Lys Asn Cys Val His Arg Asp Leu Ala Ala Arg 
815 820 825 830 

AAC GTG CTC ATC TGT GAA GGC AAG CTG GTC AAG ATC TGT GAC TTT GGC 2724 
Asn Val Leu He Cys Glu Gly Lys Leu *Val Lys He Cys Asp Phe Gly 
835 840 845 
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ACC TTT TTG CCT TTA AAG TGG ATG GCT CCG GAG AGC ATC TTC AAC AGC 
£u p£ Leu Lys Trp Met Ala Pro Glu Ser lie Phe Asn Ser 

B65 570 o/a 

S|.SSS sg a s is s g s S 2 3 

gsassgSsssssgssssg 

GAG CAG TTC TAC AAT GCC ATC AAA CGG GGT TAC CGC ATG GCC CAG CCT 
Glu Gin Pee Tyr Asn Ala lie Lya Arg Gly Tyr Arg Met Ala Gin Pro 
915 920 *r a 

GCC CAT GCC TCC GAC GAG ATC TAT GAG ATC ATG CMS AAG TGC TGG GAA 
Ala His Ala ier Aap Glu. He Tyr Glu lie Met Gin Ly. cy. Trp Glu 
930 935 

GAG AAG TTT GAC ATT CGC CCC CCC TTC TCC CAG CTG CTG CTG CTT CTC 
Glu Lys Phe Glu lie Arg Pro Pro Phe Ser Gin Leu Val Leu Leu Leu 
945 950 SSD 

CAG CAG GTG GAT 
Gin Gin Val Asp 



GAG AGA CTG TTG GGC GAA GGT TAC AAA AAG AAG TAC 
Glu Arg Leu Leu Gly Glu Gly Tyr Lys Lys Lys Tyr 

960 965 970 

GAG GAG TTT CTG AGC AGT GAC CAC CCA GCC ATC CTT 
Glu Glu Phe Leu Arg Ser Asp His Pro Ala lie Leu 
975 9*0 985 

CCC TTG CCT GGC TTC CAT GGC CTC CGA TCT CCC CTG 
Arg Leu Pro Gly Phe His Gly Leu Arg Ser Pro Leu 
995 1000 



CGG TCC CAC GCC 
Arg Ser Gin Ala 

990 

GAC ACC AGC TCC 
Asp Thr Ser Ser 
1005 



GTC CTC TAT ACT GCC GTG CAG CCC AAT GAG GGT GAC AAC GAC TAT ATC 
Val Leu Tyr Val Gln pro ££ 5 qlU Gly P 1020 

ATC CCC CTG CCT GAC CCC AAA CCT GAG GTT GCT GAC GAG GGC CCA CTG 
lie Pro Leu Pro Asp Pro Lys Pro Glu Val Ala Asp Glu Gly Pro Leu 
1025 1030 

GAG GGT TCC CCC AGC CTA GCC AGC TCC ACC CTG AAT GAA GTC AAC ACC 
Glu Gly Ser Pro Ser Leu Ala Ser Ser Thr Leu Asn Glu val Asn xnr 
1040 1045 1050 

TCC TCA ACC ATC TCC TGT GAC AGC CCC CTG GAG CCC CAG GAC GAA CCA 
sir Ser Thr He Ser cys Asp Ser Pro Leu Glu Pro Gin Asp Glu Pro 
X055 1060 1065 10 /u 

GAG CCA GAG CCC CAG CTT GAG CTC CAG GTG GAG CCG GAG CCG GAG CTG 
Glu Pro Glu Pro Gin Leu Glu Leu Gin Val Glu Pro Glu Pro Glu Leu 
1075 1080 1UIJD 

GAA CAG TTG CCG GAT TCG GGC TGC CCT GCG CCT CGG GCG GAA GCA GAG 

SS Gin Leu Pro Asp Ser Gly Cys Pro Ala Pro Arg Ala Glu Ala Glu 
1090 1095 liOU 

GAT AGC TTC CTG TACCGCGCTG GCCCCTACCC TGCCCTGCCT GAAGCTCCCC 
Asp Ser Phe Leu 
1105 

. CGCTGCCAGC ACCCAGCATC TCCTGGCCTG GCCTGGCCGG GCTTCCTGTC AGCCAGGCTG 
CCCTTATCAG CTGTCCCCTT CTGGAAGCTT TCTGCTCCTG ACGTGTTGTG CCCCAAACCC 



2820 

2868 

2916 

2964 

3012 

3O60 

310B 

3156 

3204 

3252 

3300 

3348 

3396 

3444 

3492 

3544 

3604 
3664 
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TGGGAAAGTT AGGCTTGATG ACCCAGAATC TAGGATTCTC TCCCTGGCTG ACAGGTGGGG 3B44 

AGACCGAATC CCTCCCTGGG AAGATTCTTG GAGTTACTGA GGTGGTAAAT TAACTTTTTT 3904 

CT C TTCAGCC AGCTACCCCT CAAGGAATCA TAGCTCTCTC CTCGCACTTT TATCCACCCA 3964 

GGAGCTAGGG AAGAGACCCT AGCCTCCCTG GCTGCTGGCT GAGCTAGGGC CTAGCCTTGA 4024 

GCAGTGTTGC CTCATCCAGA AGAAAGCCAG TCTCCTCCCT ATGATGCCAG TCCCTGCGTT 4084 

CCCTGGCCCC AGCTGGTCTG GGCCCATTAG GCAGCCTAAT TAATGCTGGA GGCTGAGCCA ' • 4144 

AGTACAGGAC ACCCCCAGCC TGCAGCCCTT GCCCAGGGCA CTTGGAGCAC ACGCAGCCAT 4204 

AGCAAGTGCC T G T G T CC CT G TCCTTCAGGC CCATCACTCC TGGGGCTTTT TCTTTATCAC 4264 

.CCTCAGTCTT AATCCATCCA CCAGAGTCTA GAAGGCCAGA CGGGCCCC6C ATCTGTGATG 4 324 

AGAATGTAAA TGTGCCAGTG TGGAGTGGCC ACGTGTGTGT GCCAGATATG GCCCTGGCTC 4384 

TGCATTGGAC CTGCTATGAG GCTTTGGAGG AATCCCTCAC CCTCTCTGGG CCTCAGTTTC 4444 

CCCTTCAAAA AATGAATAAG TCGGACTTAT TAACTCTGAQ TGCCTTGCCA GCACTAACAT 4 504 

TCTAGAGTAT CCAGGTGGTT GCACATTTGT CCAGATGAAG CAAGGCCATA TACCCTAAAC 4 564 

TTCCATCCTG GGGGTCAGCT GGGCTCCTGG GAGATTCCAG ATCACACATC ACACTCTGGG 4624 

GACTCAGGAA CCATGCCCCT TCCCCAGGCC CCCAGCAAGT CTCAAGAACA CAGCTGCACA 4684 

GGCCTTGACT TAGAGTGACA GCCGGTGTCC TGGAAAGCCC CCAGCAGCTG CCCCAGGGAC 4744 

ATGGGAAGAC CACGGGACCT CTTTCACTAC CCACGATGAC CTCCGGGCGT ATCCTGGGCA 4804 

AAAGGGACAA AGAGGGCAAA TGAGATCACC TCCTGCAGCC CACCACTCCA GCACCTGTGC 4864 

CGAGGTCTGC GTCGAAGACA GAATGGACAG TGAGGACAGT TATCTCTTGT AAAAGACAAG 4924 

AAGCTTCAGA TGGGTACCCC AAGAAGGATG TGAGAGGTGG GCGCZTTGGA GGTTTGCCCC 49 B4 

TCACCCACCA GCTGCCCCAT CCCTGAGGCA GCGCTCCATG GGGGTATGGT TTT.GTCACTG 5044 

CCCAGACCTA GCAGTGACAT CTCATTGTCC CCAGCCCAGT GGGCATTGGA GGTGCCAGGG 5104 

GAGTCAGGGT TGTAGCCAAG ACGCCCCCGC ACGGGGAGGG TTGGGAAGGG GGTGCAGGAA 5164 

GCTCAACCCC TCTGGGCACC AACCCTGCAT TGCAGGTTGG CACCTTACTT CCCTGGGATC 5224 

CCAGAGTTGG TCCAAGGAGG GAGAGTGGGT TCTCAATACG GTACCAAAGA TATAATCACC 5284 

TAGGTTTACA AATATTTTTA GGACTCACGT TAACTCACAT TTATACAGCA GAAATGCTAT 5344 

TTTGTATGCT GTTAAfcTTTT TCTATCTGTG TACTTTTTTT TAAGGGAAAG ATTTTAATAT 5404 

TAAACCTGGT GCTTCTCACT CAC ' 5427 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1X06 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
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20 



'is " 40 

Thr Phe Val Leu Thr cys Ser G!y Ser Ala Pro Val Val Trp Glu Arg 



:M val val Thr Pro Pro Gly Pro Glu L«u Val Leu Asn Val Ser Ser 

ser Gly Ser Ala Pro Val 
50 55 60 

Met Ser Gin Glu Pro Pro Gin Glu Met Ala Lys Ala Gin Asp Gly Thr 
65 70 75 

Phe Ser Ser Val Leu Thr Leu Thr Asn Leu Thr Gly Leu Asp Thr Gly 
85 

Glu Tyr Ph. Cys Thr His Asn Asp ser Arg Gly Leu Glu Thr Asp Glu 
100 i05 

Ax? Lys Arg Leu Tyr He Phe Val Pro Asp Pro Thr Val Gly Phe Leu 



Pro Asn Asp Ala Glu Glu leu Phe Xle Phe Leu Thr Glu He Thr Glu 

He Thr lie Pro cys Arg Val Thr Asp Pro Gin Leu Val Val Thr Leu 
^45 150 A3» 

His Glu Lys Lys Gly Asp Val Ala Leu Pro val Pro Tyr Asp His Gin 

Arg Gly Phe Ser Gly He Phe Glu Asp Arg Ser Tyr He Cys Lys Thr 
1B0 185 

Thr He Gly Asp Arg Glu val Asp Ser Asp Ala Tyr Tyr val Tyr Arg 

195 200 * v 

L.u Gin val Ser Ser He Asn Val Ser Val Asn Ala Val Gin Thr val 

210 ^15 

val Arg Gin Gly Glu Asn He Thr Leu Met cys He Val He Gly Asn 
225 ■ 230 23 

Asp val Val Asn Phe Glu Trp Thr Tyr Pro Arg Lys Glu Ser Gly Arg 

245 250 

Leu Val Glu Pro Val Thr Asp Phe Leu Leu Asp Met Pro Tyr His He 

260 265 
Arg ser He Leu His He Pro Ser Ala Glu Leu Glu Asp Ser Gly Thr 

Tyr Thr Cys Asn Val Thr Glu Ser Val Asn Asp His Gin Asp Glu Lys 

Ala He Asn He Thr Val Val Glu Ser Gly Tyr Val Arg Leu Leu Gly 
305 31° 31S 

Glu Val Gly Thr Leu Gin Phe. Ala Glu Leu His Arg Ser Arg Thr Leu 
325 33u 

Gin val val Phe Glu Ala Tyr Pro Pro Pro Thr Val Leu Trp Phe Lys 
340 3*5 J3U 

Asp Asn Arg Thr Leu Gly Asp Ser Ser Ala Gly Glu He Ala Leu Ser 
F 355 360 • Joa 

Thr Arg Asn val Ser Glu Thr Arg Tyr Val. Ser Glu Leu Thr Leu Val 
370 375 380 

Arg Val Lys Val Ala Glu Ala Gly His Tyr Thr Met Arg Ala Phe His 
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Val Arg val Uu Glu Leu Ser Glu Ser His Pro Asp Ser Gly Glu Gin 
420 425 430 

Thr Val Arg Cys Arg Gly Arg Gly Met Pro Gin Pro Asn lie He Trp 
435 440 445 

Ser Ala Cys Arg Asp Leu Lys Arg Cys Pro Arg Glu Leu Pro Pro Thr 
450 455 460 

Leu Leu Gly Asn Ser Ser Glu Glu Glu Ser Gin Leu Glu Thr Asn Val 
465 470 475 480 

Thr Tyr Trp Glu Glu Glu Gin Glu Phe Glu Val Val Ser Thr Leu Arg 
485 490 495 

Leu Gin His Val Asp Arg Pro Leu Ser Val Arg Cys Thr Leu Arg Asn 
500 505 510 

Ala Val Gly Gin Asp Thr Gin Glu Val He Val Val Pro His Ser Leu 
515 520 525 

Pro Phe Lys Val Val Val He Ser Ala He Leu Ala Leu Val Val Leu 
530 535 540 

Thr He He Ser Leu He He Leu He Met Leu Trp Gin Lys Lys Pro 
545 550 555 560" 

Arg Tyr Glu -He Arg Trp Lys Val He Glu Ser Val Ser Ser Asp Gly 
565 570 575 

His Glu Tyr He Tyr Val Asp Pro Met Gin Leu Pro Tyr Asp Ser Thr 
580 585 590 

Trp Glu Leu Pro Arg Asp Gin Leu Val Leu Gly Arg Thr Leu Gly Ser 
595 600 605 

Gly . Ala Phe Gly Gin Val Val Glu Ala Thr Ala His Gly Leu Ser His 
610 615 620 

Ser Gin Ala Thr Met Lys Val Ala Val Lys Met Leu Lys Ser Thr Ala 
625 630 . 635 640 

Arg Ser Ser Glu Lys Gin Ala Leu Met Ser Glu Leu Lys He Met Ser 
645 650 655 

His Leu Gly Pro His Leu Asn Val Val Asn Leu Leu Gly Ala Cys Thr 
660 665 670 

Lys Gly Gly Pro He Tyr He He Thr Glu Tyr Cys Arg Tyr Gly Asp 
675 680 685 

Leu Val Asp Tyr Leu His Arg Asn Lys His Thr Phe Leu Gin His His 
690 695 700 

. Ser Asp Lys Arg Arg Pro Pro Ser Ala Glu Leu Tyr Ser Asn Ala Leu 
705 710 715 720 

Pro Val Gly Leu Pro Leu Pro ser His val Ser Leu Thr Gly Glu Ser 
725. 730 735 

Asp Gly Gly Tyr Met Asp Met Ser Lys Asp Glu Ser Val Asp Tyr Val 
740 .745 750 

Pro Met Leu Asp Met Lys Gly Asp Val Lys Tyr Ala Asp He Glu Ser 
755 760 765 

Ser Asn Tyr Met Ala Pro Tyr Asp Asn Tyr Val Pro Ser Ala Pro Glu 
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Met Asp Leu Val Gly Phe Ser Tyr Gin Val Ala Asn Gly Met Glu. phe 

1 nr\r filfl B15 




Leu Ala Ser Lys Asn cys Val His Arg Asp Leu Ala Ala Arg Asn Val 
820 825 830 

Leu He Cys Glu Gly Lys Leu Val Lys He Cys Asp Phe Gly Leu Ala 
835 8*° 845 

Arg Asp He Met Arg Asp Ser Asn Tyr He Ser Lys Gly Ser Thr Phe 
850 855 860 

Leu Pro Leu Lys Trp Met Ala Pro Glu Ser He Phe Asn Ser Leu Tyr 
865 870 875 880 

Thr Thr Leu Ser Asp Val Trp Ser Phe Gly He Leu Leu Trp Glu He 
885 890 895 

Phe Thr Leu Gly Gly Thr Pro Tyr Pro Glu Leu Pro Met Asn Glu Gin 
900 905 910 

Phe Tyr Asn Ala He Lys Arg Gly Tyr Arg Met Ala Gin Pro Ala His 
J 915 920 925 

Ala Ser Asp Glu He Tyr Glu He Met Gin Lys Cys Trp Glu Glu Lys 
930 935 940 

?he Glu He Arg Pro Pro Phe Ser Gin Leu Val Leu Leu Leu Glu Arg 
945 950 955 960 

Leu Leu Gly Glu Gly Tyr Lys Lys Lys Tyr Gin Gin val Asp Glu Glu 
965 970 975 

Phe Leu Arg Ser Asp His Pro Ala He Leu Arg Ser Gin Ala Arg Leu 
980 985 990 

Pro Gly Phe His Gly Leu Arg Ser Pro Leu Asp Thr Sex Ser val Leu 
995 " 1000 1005 

Tyr Thr Ala Val Gin Pro Asn Glu Gly Asp Asn Asp Tyr He He Pro 
1010 1015 1020 

Leu Pro Asp Pro Lys Pro Glu val Ala Asp Glu Gly Pro Leu Glu Gly 
1C25 1030 1035 104C 

Ser Pro Ser Leu Ala Ser Ser Thr Leu Asn Glu Val Asn Thr Ser Ser 
1045 1050 1055. 

Thr He Ser Cys Asp Ser Pro Lev Glu Pro Gin Asp Glu Pro Glu Pro 
1060 1065 1070 

Glu Pro Gin Leu Glu lieu Gin Val Glu Pro Glu Pro Glu Leu Glu Gin 
1075 1080 1085 

Leu Pro Asp Ser Gly Cys Pro Ala Pro Arg Ala Glu Ala Glu Asp Ser 
1090 . 1095 1100 

Phe Leu 
1105 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4100 base pairs 

(B) TYPE: nucleic acid* 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE; 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtio 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 129.. 3395 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTGGAGCTAC AGGGAGAGAA ACAGAGGAGG AGACTGCAAG AGATCATTGG AGGCCGTGGG 60 

CACGCTCTTT ACTCCATGTG TGCGACATTC ATTGCGGAAT AACATCGGAG GAGAAGTTTC 120 

CCAGAGCT ATG CGC ACT TCC CAT CCG GCC TTC CTG GTC TTA GCC TGT CTT 170 
net ciy Thr Scr His Pro Ala Phe Leu val Leu Gly cys Leu 

■ 1 5 10 

J££ Sf 5 F G AGC CTA ATC CTC TGC CAG CTT TCA TTA CCC TCT ATC ,i ft 
Uu Thr Gly Leu Ser l*u lie Leu Cys Gin SS s£ Hu I" lie 218 
20 25 30 



SBBaassiSBaaassssg - 
as sk ss ss is g ss a s % k si ai as as is » 

D 70 75 

Sy Se v2 Thr 55 H G S 1 ? AGC AGT CGC TCG GCG GCC CAC 410 

y an. 1 ^2 Glu Val Ser s « r Ala Ser Ala Ala His 

85 90 

Thr Gly SS £r tS 25* ™ C ** C <* C CAC ACA GAA GAG AAT 

Thr ciy Leu Tyr Thr Cys Tyr Tyr Asn His Thr Gin Thr Glu Glu Asn 



105 no 



SS SS Glu Gly tTI £E A ? C * AT GTG cca cca gta 

&AU Gly ~fg His Ile Tyr lie Tyr Val Pro Asp Pro Asp Val 
115 120 125 



iso r 155 

52 Thr SS Sis SS S 6 GTG CCT GCC TCC ™c GAC AGC 

Thr leu His Asn Ser Glu Gly Val Val Pro Ala Ser Tyr Asp Ser 
165 i7o 



458 



506 



. 554 



52 SS Val pS SS IS 2*5 GAT TAT TTA GTC ATC GTG GAG GAT 

Ala Phe val Prp Leu Gly Met Thr Asp Tyr Leu Val He Val Glu Asp 

135 140 

ASP Asp S« Ala iS fit S CT £? T CGC ACA ACT GAT CCC GAG ACT CCT • 602 

Asp Asp Ser Ala He He Pro Cys Arg Thr Thr Asp Pro Glu Thr Pro 



650 



sgssjsjsjssaisssssssS «• 
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ACC GTG TAT AAC TCA GGG GAA ACG ATT GTC GTC ACC TGT GCT GTT TTT 842 
Thr Val Tyr Lys Ser Gly Glu Thr lie Val Val Thr cys Ala val Phe 
225 230 235 

AAC AAT GAG GTG GTT GAC CTT CAA TGG ACT TAC CCT GGA GAA GTG AAA 890 

Asn Asn Glu val Val Asp Leu Gin Trp Thr Tyr Pro Gly Glu val Lys 
240 245 250 

GGC AAA GGC ATC ACA ATG CTG GAA GAA ATC AAA GTC CCA TCC ATC AAA 938 
Gly Lys Gly lie Thr~Met Leu Glu Glu lie Lys Val Pro Ser He Lys 
255 260 265 270 

TTG GTG TAC ACT TTG ACG GTC CCC GAG GCC ACG GTG AAA GAC AGT GGA 986 
Ltu Val Tyr Thr Leu Thr Val Pro Glu Ala Thr val Lys Asp ser Gly 
275 280 285 

GAT TAC GAA TGT GCT GCC CGC CAG GCT ACC AGG GAG GTC AAA GAA ATG 1034 
Asp Tyr Glu cys Ala Ala Arg Gin Ala Thr Arg Glu Val Lys Glu Met 
. 290 295 300 

AAG AAA GTC ACT ATT TCT GTC CAT GAG AAA GGT TTC ATT GAA ATC AAA 1082 
Lys Lys Val Thr He Ser Val His Glu Lys Gly Phe He Glu He Lys 
305 310 315 

CCC ACC TTC AGC CAG TTG GAA GCT GTC AAC CTG CAT GAA GTC AAA CAT 1130 
Pro Thr Phe Ser Gin Leu Glu Ala Val Asn Leu His Glu Val Lys His 
320 325 330 

TTT GTT GTA GAG GTG CGG GCC TAC CCA CCT CCC AGG ATA TCC TGG CTG 1178 
Phe Val Val Glu Val Arg Ala Tyr Pro Pro Pro Arg He Ser Trp Leu 
335 340 345 350 

AAA AAC AAT CTG ACT CTG ATT GAA AAT CTC ACT GAG ATC ACC ACT GAT 1226 
Lys Asn Asn Leu Thr Leu He Glu Asn Leu Thr Glu He Thr Thr Asp 
355 360 365 

GTG GAA AAG ATT CAG GAA ATA AGG TAT CGA AGC AAA TTA AAG CTG ATC 1274 
Val Glu Lys He Gin Glu He Arg Tyr Arg Ser Lys Leu Lys Leu He 
370 375 380 

CGT GCT AAG GAA GAA GAC AGT GGC CAT TAT ACT ATT GTA GCT CAA AAT 1322 
Arg Ala Lys Glu Glu Asp Ser Gly His Tyr Thr He Val Ala Gin Asn 
385 390 395 

GAA GAT GCT GTG AAG AGC TAT ACT TTT GAA CTG TTA ACT CAA GTT CCT 1370 
Glu Asp Ala Val Lys Ser Tyr Thr Phe Glu Leu Leu Thr Gin Val Pro 
400 405 410 

TCA TCC ATT CTG GAC TTG GTC GAT GAT CAC CAT GGC TCA ACT GGG GGA 1418 
Ser Ser . He Leu Asp Leu Val Asp Asp His His Gly Ser Thr Gly Gly 
.415 420 425 430 

CAG ACG GTG AGG TGC ACA GCT GAA GGC ACG CCG CTT CCT GAT ATT GAG 1466 
Gin Thr Val Arg Cys Thr Ala Glu Gly Thr Pro Leu Pro Asp He Glu 
435 440 445 

TGG ATG ATA TGC AAA GAT ATT AAG AAA TGT AAT AAT GAA ACT TCC TGG 1514 
Trp Met. He Cys Lys Asp He Lys. Lys Cys Asn Asn Glu Thr Ser Trp 
450 455 460 

ACT AIT TTG GCC AAC AAT GTC TCA AAC ATC ATC ACG GAG ATC CAC TCC 1562 
Thr He Leu Ala Asn Asn Val Ser Asn He He Thr Glu He His Ser 
465 470 475 

CGA GAC AGG AGT ACC GTG GAG GGC CGT GTG ACT TTC GCC AAA GTG GAG 1610 

Arg Asp Arg Ser Thr Val Glu Gly Arg Val Thr Phe Ala Lys Val Glu 

480 485 490 
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AAC CGA GAG CTG AAG CTG GTG GCT CCC ACC CTG CGT TCT GAA CTC ACG 1706 
Asn Arg Glu Leu Lys Leu Val Ala Pro Thr Leu Arg ser Glu Leu Thr 
515 520 525 



GTG GCT GCT GCA GTC CTG GTG CTG TTG GTG ATT GTG ATC ATC TCA CTT 1754 
Val Ala Ala Ala Val Leu Val Leu Leu Val lie Val lie lie Ser Leu 
. 530 535 540 

ATT GTC CTG GTT GTC ATT TGG AAA CAG AAA CCG AGG TAT GAA ATT CGC 1802 
lie Val leu Val Val lie Trp Lys Gin Lys Pro Arg Tyr Glu He Arg 
545 550 555 

TGG AGG GTC ATT GAA TCA ATC AGC CCA GAT GGA CAT GAA TAT ATT TAT 1850 

Trp Arg Val lie Glu Ser He Ser Pro Asp Gly His Glu Tyr He Tyr 

560 565 570 

GTG GAC CCG ATG CAG CTG CCT TAT GAC TCA AGA TGG GAG TTT CCA AGA 1B98 

Val Asp Pro Met Gin Leu Pro Tyr Asp Ser Arg Trp Glu Phe Pro Arg 

575. 580 585 . 590 

GAT GGA CTA GTG CTT GGT CGG GTC TTG GGG TCT GGA GCG TTT GGG AAG 1946 
Asp Gly Leu Val Leu Gly Arg Val Leu Gly ser Gly Ala Phe Gly Lys 
595 600 605 

GTG GTT GAA GGA ACA GCC TAT GGA TTA AGC CGG TCC CAA CCT GTC ATG 1994 
Val Val Glu Gly Thr Ala Tyr Gly Leu Ser Arg Ser Gin Pro Val Met 
610 615 620 

AAA GTT GCA GTG AAG ATG CTA AAA CCC ACG GCC AGA TCC AGT GAA AAA 2042 
Lys Val Ala Val Lys Met Leu Lys Pro Thr Ala Arg Ser Ser Glu Lys 
625 630 635 

CAA GCT CTC ATG TCT GAA CTG AAG ATA ATG ACT CAC CTG GGG CCA CAT 2090 

Gin Ala Leu Met Ser Glu Leu Lys He Het Thr His Leu Gly Pro His 

640 645 650 

TTG AAC ATT GTA AAC TTG CTG GGA GCC TGC ACC AAG TCA GGC CCC ATT 2138 
Leu Asn lie Val Asn Leu Leu Gly Ala Cys Thr Lys Ser Gly Pro He 
655 660 " 665 670 

TAC ATC ATC ACA GAG TAT TGC TTC TAT GGA GAT TTG GTC AAC TAT TTG 2186 
Tyr He He Thr Glu Tyr cys Phe Tyr Gly Asp Leu Val Asn Tyr Leu 

675 680 685 

CAT AAG AAT AGG GAT AGC TTC CTG AGC CAC CAC CCA GAG AAG CCA AAG 2234 
His Lys Asn Arg Asp Ser Phe Leu Ser His His Pro Glu Lys Pro Lys 
690 695 700 

AAA GAG CTG GAT ATC TTT GGA TTG AAC CCT GCT GAT GAA AGC ACA CGG 2282 
Lys Glu Leu Asp He Phe Gly Leu Asn Pro Ala Asp Glu Ser Thr Arg 
705 710 715 

AGC TAT GTT ATT TTA TCT TTT GAA AAC AAT GGT GAC TAC ATG GAC ATG 2330 
Ser Tyr Val He Leu Ser Phe Glu Asn Asn Gly Asp Tyr Met Asp Met 
720 725 730 

AAG CAG GCT GAT ACT ACA CAG TAT GTC CCC ATG CTA GAA AG« AAA GAG 2378 
Lys Gin Ala Asp Thr Thr Gin Tyr Val Pro Met Leu Glu Arg Lys Glu 
735 . 740 " 745 750 

GTT TCT AAA TAT TCC GAC ATC CAG AGA TCA CTC TAT GAT CGT CCA GCC 2426 
Val Ser Lys Tyr Ser Asp He Gin Arg Ser Leu Tyr Asp Arg Pro Ala 
755 760 765 

TCA TAT AAG AAG AAA TCT ATG TTA GAC TCA GAA GTC AAA AAC CTC CTT 2474 
Ser Tyr Lys Lys Lys Ser Met Leu Asp Ser Glu Val Lys Asn Leu Leu 
770 775 780 

TCA GAT GAT AAC TCA GAA GGC CTT ACT TTA TTG GAT TTG TTG AGC TTC 2522 
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Thr Tyr Gin Val Ala Arg Gly Met Glu Phe Leu Ala Ser Lys Asn cys 

800 805 810 

GTC CAC CGT GAT CTG GCT GCT CGC AAC GTT CTC CTG GCA CAX GGA AAA 
val His Arg Asp Leu Ala Ala Arg Asn Val Leu Leu Ala Gin Gly Lys 
815 820 825 830 

ATT GTG AAG ATC TGT GAC TTT GCC CTG GCC AGA GAC ATC ATG CAT GAT 
He Val Lys lie Cys Asp Phe Gly Leu Ala Arg Asp He Met His Asp 
835 840 8*5 

TCG AAC TAT GTG TCG AAA GGC AGT ACC TTT CTG CCC GTG AAG TGG .ATG 

ser Asn Tyr Val Ser Lys Gly Ser Thr Phe Leu Pro Val Lys Trp Met 
850 855 860 

GCT CCT GAG AGC ATC TTT GAC AAC CTC TAC ACC ACA CTG AGT GAT GTC 
Ala Pro Glu Ser He Phe Asp Asn Leu Tyr Thr Thr Leu Ser Asp Val 
865 870 875 

TGG TCP TAT GGC ATT CTG CTC TGG GAG ATC TTT TCC CTT GGT GGC ACC 
Trp Ser Tyr Gly lie Leu Leu Trp Glu He Phs Ser Leu Gly Gly Thr 
r 880 885 890 

CCT TAC CCC GGC ATG ATG GTG GAT TCT ACT TTC TAC AAT AAG ATC AAG 
Pro Tyr Pro Gly Met Met Val Asp Ser Thr Phe Tyr Asn Lys He Lys 
B95 900 905 910 

AGT GGG TAC CGG ATG GCC AAG CCT GAC CAC GCT ACC AGT GAA GTC TAC 
Ser Gly Tyr Arg Met Ala Lys Pro Asp His Ala Thr Ser Glu. Val Tyr 
915 920 925 

GAG ATC ATG GTG AAA TGC TGG AAC AGT GAG CCG GAG AAG AGA CCC TCC 
Glu He Met Val Lys Cys Trp Asn Ser Glu Pro Glu Lys Arg Pro Ser 
930 935 940 / 

TTT TAC CAC CTG AGT GAG ATT GTG GAG AAT CTG CTC CCT GGA CAA TAT 
Phe Tyr His Leu Ser Glu He Val Glu Asn Leu Leu Pro Gly Gin Tyr 
945 950 955 

AAA AAG AGT TAT GAA AAA ATT CAC CTG GAC TTC CTG AAG AGT GAC CAT 
Lys Lys Ser Tyr Glu Lys He His Leu Asp Phe Leu Lys Ser Asp His 
960 965 970 

CCT GCT GTG GCA CGC ATG CGT GTG GAC TCA GAC AAT GCA TAC ATT GGT 
Pro Ala Val Ala Arg Met Arg Val Asp Ser Asp Asn Ala Tyr He Gly 
975 980 985 $90 

. GTC ACC TAC AAA AAC GAG GAA GAC AAG CTG AAG GAC TGG GAG GGT GGT 
Val Thr Tyr Lys Asn, Glu Glu Asp Lys Leu Lys Asp Trp Glu Gly Gly 
995 1000 1005 

CTG GAT GAG CAG AGA CTG AGC GCT GAC AGT GGC TAC ATC ATT CCT CTG 
Leu Asp Glu Gin Arg Leu Ser Ala Asp Ser Gly Tyr lie He Pro Leu 
1010 1015 1020 

CCT GAC ATT GAC CCT GTC CCT GAG GAG GAG GAC CTG GGC AAG AGG AAC 
Pro Asp He Asp Pro Val. Pro Glu Glu Glu Asp Leu Gly Lys Arg Asn 
1025 1030 1035 

AGA CAC AGC TCG CAG ACC TCT GAA GAG AGT CCC ATT GAG ACG GGT TCC 
Arg His Ser Ser Gin Thr Ser Glu Glu Ser Ala He Glu Thr Gly Ser 
1040 1045 1050 

AGC AGT TCC ACC TTC ATC AAG AGA GAG GAC GAG ACC ATT GAA GAC ATC 
Ser Ser Ser Thr. Phe He Lys Arg Glu Asp Glu Thr He Glu Asp lie 
1055 1060 1065 1070 

GAC ATG ATG GAC GAC ATC GGC ATA GAC TCT TCA GAC CTG GTG GAA GAC 
Asp Met Met Asp Asp He Gly He Asp Ser Ser Asp Leu val Glu Asp 



2618 



2666 



2714 
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2810 



2858 
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3002 



3050 
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3146 



3194 



3242 
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Ser Phe Leu 

ACCTCTGGAT CCCCTTCACA AAACCACTTT ATTGCAATGC GGAGGTTOAG AGGAGGACTT 3495 

GCTTGATGTT TAAAGAGAAG TTCCCAGCCA AGGGCCTCGG GGAGCCTTTC TAAATATGAA 3555 

TGAATGGGAT ATTTTGAAAT GAACTTTGTC AGTGTTGCCT CTTGCAATGC CTCAGTAGCA 3 615 

TCTCACTCGT GTGTGAAGTT TGGAGATACA TGGATAAGGG AAXAATAGGC CACAGAAGGT 3675 

GAACTTTCTG CTTCAAGGAC ATTGGTGAGA GTCCAACAGA CACAATTTAT ACTGCGACAG 3735 

AACTTCAGCA TTGTAATTAT GTAAATAACT CTAACCACGG CTGTGTTTAG ATTGTATTAA 3795 

CTAT CTTCTT TGGACTTCTG AAGAGACCAC TCAATCCATC CATGTACTTC CCTCTTGAAA 3855 

CCTGATGTCA GCTGCTGT TG AACTTTTTAA AGAAGTGCAT GAAAAACCAT TTTTGACCTT 39X5 

AAAAGGTACT GGTACTATAG CATTTTGCTA T CTTTTTT AG TGTTAAAGAG ATAAAGAATA 3975 

ATAATTAACC AACCTTGTTT AATAGATTTG GCTCATTTAG AAGCCTGACA ACTCATTTTC 4035 

ATATTGTAAT CTATGTTTAT AATACTACTA CTGTTATCAG TAATGCTAAA TGTGTAATAA 4095 

TGTAA 4100 

(2) INFORMATION FOR SEQ ID NO: 4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Tbr Ser His Pro Ala Phe Leu Val Leu Gly Cys Leu Leu Thr 
1 5 10 15 

Gly Leu Ser Leu lie Leu Cys Gin Leu Ser Leu Pro Ser lie Leu Pro 
20 25 30 

Asn Glu Asn Glu Lys Val Val Gin Leu Asn Ser Ser Phe Ser Leu Arg 
35 40 45 

Cys Phe Gly Glu Ser Glu Val Ser Trp Gin Tyr Pro Met Ser Glu Glu 
50 55 60 

Glu Ser Ser Asp Val Glu lie Arg Asn Glu Glu Asn Asn Ser Gly Leu 
65 70 75 80 

Phe Val Thr Val Leu Glu Val Ser Ser Ala Ser Ala Ala His Thr Gly 
85 90 95 

Leu Tyr Thr cys Tyr Tyr Asn His Thr Gin Thr Glu Glu Asn Glu Leu 

100 105 110 

Glu Gly Arg His lie Tyr lie Tyr Val Pro Asp Pro Asp Val Ala Phe 
115 120. 125 

Val Pro Leu Gly Met Thr Asp Tyr Leu Val lie Val Glu Asp Asp Asp 
130 135 140 

Ser Ala He He Pro Cys Arg. Thr Thr Asp Pro Glu Thr Pro Val Thr 
145 150 • 155 160 
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Val Lys Gly Lys Lys Phe Gin Thr He Pro Phe Asn Val Tyr Ala Leu 
195 200 205 

Lys Ala Thr Ser Glu Leu Asp Leu Glu Met Glu Ala Leu Lys Thr Val 
210 215 220 

Tyr Lys ser Gly Glu Thr He Val Val Thr Cys Ala val Phe Asn Asn 
225 230 235 240 

Glu Val Val Asp Leu Gin Trp Thr Tyr Pro Gly Glu Val Lys Gly Lys 
245 250 255 

Gly He Thr Met Leu Glu Glu He Lys Val Pro Ser lie Lys Leu val 
260 265 270 

Tyr Thr Leu Thr Val Pro Glu Ala Thr Val Lys Asp Ser Gly Asp Tyr 
275 280 285 

Glu cys Ala Ala Arg Gin Ala Thr Arg Glu Val Lys Glu Met Lys Lys 
290 295 300 

Val Thr He Ser Val His Glu Lys Gly Phe He Glu He Lys Pro Thr 
305 310 315 320 

Phe Ser Gin Leu Glu Ala Val Asn Leu His Glu Val Lys His Phe Val 
325 330 335 

Val Glu Val Arg Ala Tyr Pro Pro Pro Arg He Ser Trp Leu- Lys Asn 
340 345 350 

Asn Leu Thr Leu He Glu Asn Leu Thr Glu He Thr Thr Asp Val Glu 
355 366 365 

Lys He Gin Glu He Arg Tyr Arg Ser Lys Leu Lys Leu He Arg Ala 
370 375 380 

Lys Glu Glu Asp ser Gly His Tyr Thr He Val Ala Gin Asn Glu Asp 
385 390 395 400 

Ala Val Lys Ser Tyr Thr Phe- Glu Leu Leu Thr Gin Val Pro Ser Ser 
405 410 415 

He Leu Asp. Leu Val Asp Asp His His Gly Ser Thr Gly Gly Gin Thr 
420 425 430 

Val. Arg Cys Thr Ala Glu Gly Thr Pro Leu Pro Asp He Glu Trp Met 
435 440 445 

He cys Lys Asp He Lys Lys Cys Asn Asn Glu Thr Ser Trp Thr He 
450 455 460 

Leu Ala Asn Asn Val Ser Asn lie He Thr Glu He His Ser Arg Asp 
465 470 475 480 

Arg Ser Thr Val Glu Gly Arg Val Thr Phe Ala Lys Val Glu Glu Thr 
485 490 495 

He Ala Val Arg Cys Leu Ala Lys Asn Leu Leu Gly Ala Glu Asn Arg 
500 505 510 

Glu Leu Lys Leu Val Ala Pro Thr Leu Arg Ser Glu Leu Thr Val Ala 
515 520 " 525 

Ala Ala Val Leu Val Leu Leu Val He Val He He Ser Leu He Val 
530 535 540 

Leu Val Vol He Trp Lys Gin Lys Pro Arg Tyr Glu He Arg Trp Arg 
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Prv Met Gin Leu Pro Tyr Asp Ser Arg Trp Glu Phe Pro Arg Asp Gly 
5S0 585 590 

Leu Val Leu Gly Arg Val Leu Gly Ser Gly Ala Phe Gly Lys Val Val 
595 600 605 

Glu Gly Thr Ala Tyr Gly Leu Ser Arg Ser Gin Pro Val Met Lys Val 
610 615 " 620 

Ala Val Lys Met Leu Lys Pro Thr Ala Arg Ser Ser Glu Lys Gin Ala 
625 630 635 640 

Leu Met Ser Glu Leu Lys He Met Thr His Leu Gly Pro His Leu Asn 
645 650 655 

He Vai Asn. Leu Leu Gly Ala cys Thr Lys Ser Gly Pro lie Tyr lie 
660 665 670 

He Thr Glu Tyr Cys Phe Tyr Gly Asp Leu Val Asn Tyr Leu His Lys 
675 680 685 

Asn Arg Asp Ser Phe Leu Ser His His Pro Glu Lys Pro Lys Lys Glu 
690 695 700 

Leu Asp lie Phe Gly Leu Asn Pro Ala Asp Glu Ser Thr Arg Ser Tyr 
705 710 715 720 

Val lie Leu Ser Phe Glu Asn- Asn Gly Asp Tyr Met Asp Met Lys Gin 
725 730 735 

Ala Asp Thr Thr Gin Tyr Val Pro Met Leu Glu Arg Lys Glu Val Ser 
740 745 750 

Lys Tyr Ser Asp lie Gin Arg Ser Leu Tyr Asp Arg Pro Ala Ser Tyr 
755 760 765 

Lys Lys Lys Ser Met Leu Asp Ser Glu Val Lys Asn Leu Leu Ser Asp 
770 775 780 

Asp Asn Ser Glu Gly Leu Thr Leu Leu Asp Leu Leu Ser Phe Thr Tyr 
785 790 795 800 

Gin Val Ala Arg Gly Met Glu Phe Leu Ala Ser Lys Asn Cys Val His 
805 810 815 

Arg Asp Leu Ala Ala Arg Asn Val Leu Leu Ala Gin Gly Lys He Val 
820 825 830 

Lys lie cys Asp Phe Gly Leu Ala Arg Asp He Met His Asp Ser Asn 
835 840 845 

Tyr Val Ser Lys Gly Ser Thr Phe Leu Pro Val Lys Trp Met Ala Pro 
850 855 860 

Glu Ser He Phe Asp Asn Leu Tyr Thr Thr Leu Ser Asp Val Trp Ser 
865 870 875 880 

Tyr Gly He Leu Leu Trp- Glu lie Phe Ser Leu Gly Gly Thr Pro Tyr 
885 890 * 895 

Pro Gly Met Met Val Asp Ser Thr Phe Tyr Asn Lys He Lys Ser Gly 
900 905 910 

Tyr Arg Met Ala Lys Pro Asp His Ala Thr Ser Glu Val Tyr Glu He 
915 920 925 

Met Val Lys cys Trp Asn Ser Glu Pro Glu Lys Arg Pro Ser Phe Tyr 

930 935 940 
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ser Tyr Glu Lys lie His Leu Asp Phe Leu Lys Ser Asp His Pro Ala 
val Ala Arg Met Arg Val Asp Ser Asp Asn Ala Tyr lie Gly Val Thr 

980 9B5 990 

Tvr Lys Asn Glu Glu Asp Lys Leu Lys Asp Trp Glu Gly Gly Leu Asp 
1 1 995 1000 1005 

Glu Gin Arg Leu Ser Ala Asp Ser Gly Tyr He lie Pro Leu Pro Asp 
1010 1015 1020 

He Asp Pro Val Pro Glu Glu Glu Asp Leu Gly Lys Arg Asn Arg His 
1025 1030 1035 1° 40 

Ser Ser Gin Thr Ser Glu Glu Ser Ala He Glu Thr Gly Ser Ser Ser 
1045 1050 1055 

Ser Thr Phe He Lys Arg Glu Asp Glu Thr He Glu Asp Asp Hefc 
1060 1065 1070 

Met Asp Asp He Gly He Asp Ser Ser Asp I*u Val Glu Asp Ser Phe 
1075 1080 1085 

Leu 

(2)' INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6375 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(ili) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtlO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 129. .3395 w _ ... 
(D) OTHER INFORMATION: /note- "nucleotxde number 1 or mis 

sequence is identical to the nucleotide number 1 
of the previous 4100 long sequence" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TTGGAGCTAC AGGGAGAGAA ACAGAGGAGG AGACTGCAAG AGATCATTGG AGGCCGTGGG 60 

CACGCTCTTr ACTCCATGTG TGGGACATTC ATTGCGGAAT AACATCGGAG GAGAAGTTTC 120 

CCAGAGCTAT GGGGACTTCC CATCCGGCGT TCCTGGTCTT AGGCTGTCTT CTCACAGGGC 180 

TGAGCCTAAT CCTCtGCCAG CTTTCATTAC CCTCTATCCT TCCAAATGAA AATGAAAAGG 240 

TTGTGCAGCT GAATTCATCC TTTTCTCTGA GATGCTTTGG GGAGAGTGAA GTGAGCTGGC 300 

AGTACCCCAT GTCTGAAGAA GAGAGCTCCG ATGTGGAAAT CAGAAATGAA GAAAACAACA 360 

GCGGC CTT TT TGTGACGGTC TTGGAAGTGA GCAGTGCCTC GGCGGCCCAC ACAGGGTTGT 420 
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TCATCGTGGA GGATGATGAT TCTGCCATTA TACCTTGTCG CACAACTGAT CCCGAGACTC 600 

CTGTAACCTT ACACAACAGT GAGCGGCTGG TACCTGCCTC CTACGACAGC AGACAGGGCT 660 

TTAATGGGAC CTTCACTGTA GGGCCCTATA TCTGTGAGGC CACCGTCAAA GGAAAGAAGT 720 

TCCAGACCAT CCCATTTAAT GTTTATGCTT TAAAAGCAAC ATCAGAGCTG GATCTAGAAA 780 

TGGAAGCTCT TAAAACC6TG TATAAGTCAG GGGAAAC6AT TGTGGTCACC TCTGCTGTTT 840 

TTAACAATGA GGTGGTTGAC CTTCAATGGA CTTACCCTGG AGAAGTGAAA GGCAAAGGCA 900 

TCACAATGCT GGAACAAATC AAAGTCCCAT CCATCAAATT GGTGTACACT TTGACGGTCC 960 

CCGAGGCCAC GGTGAAAGAC AGTGGAGATT ACCAATCTGC TGCCCGCCAG GCTACCAGGG 1020 

AGGTCAAAGA AATGAAGAAA GTCACTATTT CTGTCCATGA GAAAGGTTTC ATTGAAATCA 1080 

AACCCACCTT CAGCCAGTTG GAAGCTGTCA ACCTGCATGA AGTCAAACAT TTTGTTGTAG 1140 

AGGTGCGGGC CTACCCACCT CCCAGGATAT CCTGGCTGAA AAACAATCTG ACTCTGATTG 1200 

AAAATCTCAC TGAGATCACC ACTGATGTGG AAAAGATTCA GGAAATAAGG TATCGAAGCA 1260 

AATTAAAGCT GATCCGTGCT AAGGAACAAG ACAGTGGCCA TTATACTATT GTAGCTCAAA 1320 

ATGAAGATGC TGTGAAGAGC TATACTTTTG AACTGTTAAC TCAAGTTCCT TCATCCATTC 1380 

TGGACTTGGT CGATGATCAC "CATGGCTCAA CTGGGGGACA GACGGTGAGG TGCACAGCTG 1440 

AAGGCACGCC G CTTCCTG AT ATTGAGTGGA TGATATGCAA AGATATTAAG AAATGTAATA 1500 

ATGAAACTTC CTGGACTATT TTGGCCAACA ATGTCTCAAA CATCATCACG GAGATCCACT 1560 

CCCGAGACAG GAGTACCGTG GAGGGCCGTG TGACTTTCGC CAAAGTGGAG GAGACCATCG 1620 

CCGTGCGATG CCTGGCTAAG AATCTCCTTG GAGCTGAGAA CCGAGAGCTG AAGCTGGTGG 1680 

CTCCCACCCT GCGTTCTGAA CTCACGGTGG CTGCTGCAGT CCTGGTGCTG TTGGTGATTG 1740 

TGATCATCTC ACTTATTGTC CTGGTTGTCA TTTGGAAACA GAAACCGAGG TATGAAATTC 18 00 

GCTGGAGGGT CATTGAATCA ATCAGCCCAG ATGGACATGA ATATATTTAT GTGGACCCGA 1860 

TGCAGCTGCC TTATGACTCA AGATGGGAGT TTCCAAGAGA TGGACTAGTG CTTGGTCGGG 1920 

TCTTGGGGTC TGGAGCGTTT GGGAAGGTGG TTGAAGGAAC AGCCTATGGA TTAAGCCGGT 1980 

CCCAACCTGt CATGAAAGTT GCAGTGAAGA TGCTAAAACC CACGGCCAGA TCCAGTGAAA 2040 

AACAAGCTCT CATGTCTGAA CTGAAGATAA TGACTCACCT GGGGCCACAT TTGAACATTG 2100 

TAAACTTGCT GGGAGCCTGC ACCAAGTCAG GCCCCATTTA CATCATCACA GAGTATTGCT 2160 

TCTATGGAGA TTTGGTCAAC TATTTGCATA AGAATAGGGA TAGCTTCCTG AGCCACCACC 2220 
CAGAGAAGCC AAAGAAAGAG CTGGATATCT TTGGATTGAA CCCTGCTGAT GAAAGCACAC 2280 
GGAGCTATGT TATTTTATCT TTTGAAAACA ATGGTGACTA CATGGACATG AAGCAGGCTG 2340 
ATACTACACA GTATGTCCCC ATGCTAGAAA GGAAAGAGGT TTCTAAATAT TCCGACATCC 2400 
AGAGATCACT CTATGATCGT CCAGCCTCAT ATAAGAAGAA ATCTATGTTA GACTCAGAAG 2460 
TCAAAAACCT CCTTTCAGAT GATAACTCAG AAGGCCTTAC TTTATTGGAT TTGTtGAGCT 2520 
TCACCTATCA AGTTGCCCGA GGAATGGAGT TTTTGGCTTC AAAAAATTGT GTCCACCGTG 2580 
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CCGTGAAGTG GATGGCTCCT GAGAGCATCT TTGACAACCT CTACACCACA CTGAGTGATG 2760 

TCTCGTCTTA TGGCAITCTG CTCTGGGAGA T C T TTTCCCT TGGTGGCACC CCTTACCCCG 2820 

GCATGATGGT GGATTCTACT TTCTACAATA AGATCAAGAG TGGGTACCGG ATGGCCAAGC 2880 

CTGACCACGC TACCAGTGAA GTCTACGAGA TCATGGTGAA ATGCTGGAAC AGTGAGCCGG 2940 

AGAAGAGACC CTCCTTTTAC CACCTGAGTG AGATTGTGGA GAATCTGCTG CCTGGACAAT 3000 

ATAAAAAGAG TTATGAAAAA ATTCACCTGG ACTTCCTGAA GAGTGACCAT CCTGCTGTGG 3060 

CACGCATGCG TGTGGACTCA GACAATGCAT ACATTGGTGT CACCTACAAA AACGAGGAAG 3120 

ACAAGCTGAA GGACTGGGAG GGTGGTCTGG ATGAGCAGAG ACTGAGCGCT GACAGTGGCT 3X80 

ACATCATTCC TCTGCCTGAC ATTGACCCTG TCCCTGAGGA GGAGGACCTG GGCAAGAGGA 3240 

ACAGACACAG CTCGCAGACC TCTGAAGAGA GTGCCATTGA GACGG6TTCC AGCAGTTCCA 3300 

CCTTCATCAA GAGAGAGGAC GAGACCATTG AAGACATCGA CATGAT6GAC GACATCGGCA 3360 

TAGACTCTTC AGACCTGGTG GAAGACAGCT TCCTG TA ACT GGCGGATTCG AGGGGTTCCT 3420 

TCCACrrCTG GGGCCACCTC TGGATCCCGT TCAGAAAACC ACTTTATTGC AATGCGGAGG 3480 

TTGAGAGGAG GA C TT GGT T G ATGTTTAAAG AGAAGTTCCC AGCCAAGGGC CTCGGGGAGC 3540 

CTTTCTAAAT ATGAATGAAT GGGATATTTT GAAATGAACT TTGTCAGTGT TGCCTCTTGC 3600 

AATGCCTCAG TAGCATCTCA GTGGTGTGTG AAGTTTGGAG ATAGATGGAT AAGGGAATAA 3660 

TAGGCCACAG AAGGTGAACT TTCTGCTTCA AGGACATTGG TGACAGTCCA ACAGACACAA 3720 

TTTATACTGC GACAGAACIT CAGCATTGTA ATTATGTAAA TAACTCTAAC CACGGCTGTG 3780 

TTTAGATTGT ATTAACTATC TTCTTTGGAC TTCTGAAGAG ACCACTCAAT CCATCCATGT 3840 

ACTTCCCTCT TGAAACCTGA TGTCAGCTGC TGTTGAACTT TTTAAAGAAG TGCATGAAAA 3900 

ACCATTTTTG ACCTTAAAAG GTACTGGTAC TATAGCATTT TGCTATCTTT TTTAGTGTTA 39 60 

AAGAGATAAA GAATAATAAT TAACCAACCT TGTTTAATAG ATTTGGGTCA TTTAGAAGCC 4020 

TGACAACTCA TTTTCATATT GTAATCTATG TTTATAATAC TACTACTGTT ATCAGTAATG 4080 

CTAAATGTGT AATAATGTAA CATGATTTCC CTCCACACAA AGCACAATTT AAAAACAATC 4140 

CTTACIAAGT AGGTGATGAG TTTGACAGTT TTTGACATTT ATATTAAATA ACATGTTTCT 4200 

CTATAAAGTA TGGTAATAGC TTTAGTGAAT TAAATTTAG7 TGAGCATAGA GAACAAAGTA 4260 

AAAGTAGTGT TGTCCAGGAA GTCAGAATTT TTAACTGTAC TGAATAGGTT CCCCAATCCA 4320 

TCGTATTAAA AAACAATTAA CTGCCCTCTG AAATAATGGG ATTAGAAACA AACAAAACTC 4380 

TTAAGTCCTA AAAGTTCTCA ATGTAGACGC ATAAACCTGT GCTGAACATA ACTTCTCATG 4440 

TATATTACCC AATGGAAAAT ATAATGATCA GCGCANAAAG ACTGGATTTG CAGAAGTTKT 4500 

TTTTTTTTTT TCTTCTTGCC TGATGAAAGC TTTGGCGACC CCAATATATG TATTTTTTGA 4560 

ATCTATGAAC CTGAAAAGGG TCACAAAGGA TGCCCAGACA TCAGCCTCCT TCTTTCACCC 4620 

CTTACCCCAA AGAGAAAGAG TTTGAAACTC GAGACCATAA AGATATTCTT TAGTGGAGGC 4680 

TGGAAGTGCA TTAGCCTGAT CCXCAGTTCT CAAATGTGTG . TGGCAGCCAG GTAGACTAGT 4740 

ACCTGGGTTT CCATCCTTGA GATTCTGAAG TATGAAGTCT GAGGGAAACC AGAGTCTGTA 4800 
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CAGGAAGTTG CCATGGGAAA CAAATAATTT GAACTTTGGA ACAGGGTTCT TAAGTTGGTG 4 920 

CCTCCTTCGG ATGATAAATT TAGGAACCGA AGTCCAATCA CTGTAAATTA CGGTAGATCG 4980 

ATCGTTAACC CTGGAATTAA ATTGAAAGGT CAGAATCGAC TCCGACTCTT TCGATTTCAA 5040 

ACCAAAACTG TCCAAAAGGT TTTCATTTCT ACGATGAAGG GTGACATACC CCCTCTAACT 5100- 

TGAAAGGGGC AGAGGGCAGA AGAGCGGAGG GTGAGGTATG GGGCGGTTCC TTTCCGTACA 5160 

TGTTTTTAAT ACGTTAAGTC ACAAGGTTCA GAGACACATT GGTCGAGTCA CAAAACCACC 5220 

TTTTTTGTAA AATTCAAAAT GACTATTAAA CTCCAATCTA CCCTCCTACT TAACAGTGTA 5280 

GATAGGTCTG ACAGTTTGTC CAACCACACC CAAGTAACCG TAAGAAACGT TATGACGAAT 5340 

TAACGACTAT GGTATACTTA CTTTGTACCC GACACTAATG ACGTTAGTGA CACGATACCC 5400 

GTCTACTACG AAACCTTCTA CGTCTTCGTT ATTATTTCAT GAACTGATGG ATGACCACAT 5460 

TAGAGTTACG TTCGCGGTTG AAAGAATAGG TTGAAAAAGT ATCATTCACG CTTCTGACTC 5520 

GGTCTAACCG GTTAATTTTT CTTXTGGACT GATCCAAGAC ATCTCGGTTA ATCTGAACTT 5580 

TATGCAAACA CAAAGATCTT AGTGTCGAGT TCGTAAGACA AATAGCGAGT GAGAGGGAAC 5640 

ATGTCGGAAT AAAACAACCA CGAAACGTAA AACTATAACG ACACTCGGAA CGTACTGTAG 5700 

TACTCCGGCC TACTTTGAAG AGTCAGGTCG TCAAAGGTCA GGATTGTTTA CGAGGGTGGA 5760 

CTTAAACATA TACTCACGTA AACACCCACA CACACACAAA AGTCGTTTAA GGTCTAAACA 5820 

AAGCAAAACC GGAGGACGTT TCAGAGCTCT TCTTTTAAAC GGTTAGAAAG GATGAAAGAT 5880 

AAAAATACTA CTGTTAGTTT CGGCCGGACT CTTTGTGATA AACACTGAAA AATTTGCTAA 5940 

TCACTACAGG AATTTTACAC CAGACGGTTA GACATGTTTT ACCAGGATAA AAACACTTCT 6000 

CCCTGTATTC TACTTTACTA CAATATGTAG TTATACATAT ATACATAAAG ATATATCTGA 6060 

ACCTCTTATG ACGGTTTTGT AAATACTGTT CGACATAGTG ACGGAAGCAA ATATAAAAAA 6120 

ATTGACACTA TTAGGGGTGT CCGTGTAATT GACAACGTGA AAACTTACAG GTTTTAAATA 6180 

TAAAATCTTT ATTATTTTTC TTTCTATGAA TGTACAAGGG TTTTGTTACC ACACCACTTA 6240 

CACACTCTXT TTGATTGAAC TATCCCAGAT GGTTATGTTT TACATAATGC TTACGGGGAC 6300 

AAGTACAAAA ACAAAATTTT GCACATTTAC TTCTAGAAAT ATAAAGTTAT TTACTATATA 6360 

TTAAATTTCC TTAAG 6375 
(2) INFORMATION FOR SEQ ID NO; 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDHA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 
TCCTTCGACC TACAGATCAA TTAGCTTCCT GTAGGGGGCT G 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: cOflA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
ATCACCGTGG TTGAGAGCGG CTAGCTTCCT GTAGGGGGCT G 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TACAGACTCC AGGTGTCATC CTAGCTTCCT GTAGGGGGCT G 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 44 base pairs 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL; NO 
(iv) ANTI-SENSE: NO 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 10: 
CTCTACATCT TTGTGCCAGA TCCCTAGCTT CCTGTAGGGG GCTG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: . 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CAGATCTCTC AGGGCCTGGT CACCGTGGGC TTCCTCCCTA ATCAT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

. (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hono Sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CAGATCTCTC AGGGCCTGGT CATCAACGTC TCTGTGAACG CAGTGCAG 
(2) INFORMATION FOR SEQ ID NO: 13; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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ORIGINAL SOURCE: 
(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CACATCTCTC AGGGCCTGGT CTACGTGCGG CTCCTGGGAG AGCTG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CACATCTCTC AGGGCCTGGT CGTCCGAGTG CTGGAGCTAA GT 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda grio 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCTCCCACCC TGCGTTCTGA ATAACTGGCG GATTCGAGGG G 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
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GAACTGTTAA CTCAACTTCC TTAACTGGCG GATTCCAGGG G 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 bass pairs 

(B) TYPE: nucleic acxd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
/ . 
(iii) HYPOTHETICAL: YES 

(IV) ANTI-SENSE: HO 

(vi) ORIGINAL SOURCES 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtio 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATTTCTGTCC ATGAGAAAGG TTAACTGGCG CATTCGAGG£ G 
(2) INFORKATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hobo Sapiens 

(B) STRAIN: lambda gtio 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X8: 
TATGCTTTAA AAGCAACATC ATAACTGGCG GATTCGAGGG G 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtlO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 baa* pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: KO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtlO 



(Xi) SEQUENCE DESCRIPTION I SEQ ID NO: 20: 

AGCCTAATCC TCTGCCAGCT TGATGTAGCC TTTGTACCTC TAGGA 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 48 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

- • 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtlO 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AGCCTAATCC TCTGCCAGCT TGAGCTGGAT CTAGAAATGG AAGCTCTT 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtlO 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 
AGCCTAATCC TCTGCCAGCT TTTCATTGAA ATCAAACCCA CCTTC 
(2) INFORMATION FOR SEQ ID NO: 23: 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA to mBNA 
(iii) HYPOTHETICAL: YES 
(iV) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtlO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AGCCTAATCC TCTGCCACCT TTCATCCATT CTGGACTTGG. TC 
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1. a platelet-derived growth factor receptor 
(hPDGF-R) fragment of between about 8 and 400 amino acids 
comprising one or more platelet-derived growth factor (PDGF) 
ligand binding, regions (LBR's) from extracellular domains Dl,. 
D2, or D3, wherein said fragment binds a platelet-derived 

growth factor ligand. 

2. A PDGF-R fragment of Claim 1, wherein said 

fragment exhibits an affinity of about 5 nM. 

3. a PDGF-R fragment of Claim 1, wherein said 
fragment comprises at least about 15 contiguous amino acids 
from a domain D3 intra-cysteine region. 

4. a PDGF-R fragment of Claim 1, wherein said 
fragment lacks a transmembrane region. 

5. a PDGF-R fragment of Claim 1, wherein said 

fragment is soluble. 

6. A PDGF-R fragment of Claim l f wherein at least 

one of said LBR's is a domain D3 LBR. 

7. A PDGF-R fragment of Claim 1, wherein at least 
one of said LBR's is from a type B or type A PDGF-R LBR. 

3. A PDGF-R fragment of Claim 1, wherein said 

fragment is a contiguous sequence within Table 1 or Table 2. 

9 # a PDGF-R fragment of Claim 1, wherein said 

fragment is selected from the group of formulae consisting of: 

a) Xa-Dm-Xc; 

b) Xa-Dm-Xl-Dn-Xc; 

c) Xa-Dm-Xl~Dn-X2-Dp-Xc; and 

d) Xa-Dm-Xl-Dn-X2 -Dp-X3 -Dq-Xc ; 

. wherein: 

each of Xa, Xl r X2, X3 f and Xc is, if present, 
a polypeptide segment lacking a D domain; and 

each of Dm, Dn, Dp, and Dq is, independently 
of one another, selected from the group consisting of Dl, D2, 

D3, D4, and D5. 

10. A PDGF-R fragment of Claim 1, wherein said 
fragment is selected from the group consisting of: 

a) D1-D2-D3; and 

b) D1-D2-D3-D4. 
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11. A soluble human platelet -derived growth factor 
receptor (hPDGF-R) fragment of less than about 400 amino acids 
comprising at least one platelet-derived growth factor (PDGF) 
ligand binding region (LBR) from domain D3, wherein said 

5 fragment specifically binds to a platelet-derived growth factor 
ligand. 

12. A hPDGF-R fragment of Claim 11, wherein said 
fragment comprises a sequence of at least about 15 contiguous 
amino acids from the intra-cysteine portion of domain D3. 

10 13. A hPDGF-R fragment of Claim 11, wherein said 

fragment is substantially pure. 

14. A hPDGF-R fragment of Claim 11, wherein said 

LBR is derived from a type B or type A PDGF-R, and further is a 

sequence in Table 1 or Table 2. 
15 15. A nucleic acid sequence encoding a PDGF-R 

fragment of Claim 1. 

16. A nucleic acid sequence encoding a hPDGF-R 
fragment of claim 11. 

17. A nucleic acid of Claim 15 wherein said 
20 encoding sequence is operably linked to a promoter. 

18. A cell comprising a PDGF-R fragment of 

Claim 1. 

19. A cell comprising a hPDGF-R fragment of 

Claim 11. 

25 20. A mammalian cell comprising a nucleic acid of 

Claim 15. 

21. A mammalian cell comprising a nucleic acid of 

Claim 16. 

22. A cell comprising both a nucleic acid of Claim 
30 15, and a protein expression product of said nucleic acid. 

23. An antibody which recognizes an epitope of a 
PDGF-R fragment of Claim 1, wherein said epitope is not found 
on a natural PDGF-R. 

24. An antibody of Claim 23, wherein said antibody 
35 is a monoclonal antibody. 

25. A method for measuring the PDGF ligand binding 
activity of a biological sample comprising the steps of: 
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a) contacting an aliquot of said sample to a 
PDGF ligand in the presence of a PDGF-R fragment of Claim 1 in 
a first analysis; 

b) contacting an aliquot of said sample to a 
5 PDGF ligand in the absence of said PDGF-R fragment in a second 

analysis; and 

c) comparing the amount of said PDGF ligand 
binding in the two analyses. 

26. A method of Claim 25 , wherein said PDGF-R 

10 fragment is attached to a cell. 

27. A method of Claim 26, wherein said PDGF-R 
fragment is attached to a solid substrate. 

28. A method of Claim 27, wherein said solid 
substrate is a microtiter dish. 

15 29. A method for measuring the PDGF ligand content 

of a biological sample comprising the steps of: 

a) contacting an aliquot of said sample to a 

ligand binding region (LBR) in the presence of a PDGF-R 

fragment of Claim 1 in a first analysis; 
20 b) contacting an aliquot of said sample to a 

LBR in the absence of said PDGF-R fragment in a second 

analysis; and 

c) comparing the amount of binding in the two 

analyses. 

2 5 30. A method of Claim 29 , wherein said contacting 

steps are performed simultaneously. 




ei IDGTITI ITCT RMPPT 



WO 92/13867 2 / 3 PCT/US92/tiO730 




3/3 



WO 92/13867 



PCT/US92/O0730 



INHIBITION OF RECEPTOR 
PHOSPHORYLATION BY BETA R 



r 



mm 



0.33 nM 0.33 nM 

PDGF BB PDGF AA 

7 / 7 



if 



* * J y v * % 





-p185 



. ::..v 




/76 < 



INTERNATIONAL SEARCH REPORT 

international Application No. PCT/US92/OO730 



t. CLASSIFICATION OF SUBJECT MATTER t»t several classification symools apply, indicate all) 3 
According to international Patant Dassificatton tIPC) or to Doth Naaonai Classification and IPC 
IPC (S): Pleeae Sea Attached SheeC 

US CL : 43S/2, 4, 6, 240 . 2 ; 424/85 . 3 ; 536/2?; 530/350 



H. FIELDS SEABCHgD 



Minimum Documentation Searched 4 



Svsterr 



Classification Symbols 



U.S. 



435/2. 4, 6, 240. 2; 424/85. 8; 536/27; 530/35(3 



Documentation Searcned other than Minimum Documentation 
to the extent that sucn Documents are included tn the Fields Searched 6 



APS AND 



III. DOCUMENTS CONSIDERED TO BE RELEVANT 1 



Category* 



Ctaoon of Document' 6 with indication, where appropriate, of the relevant passages' 1 



Relevant to Clam No'. ' ■ 



Cell, Volume 45, issued July :s, 1996, Ross et al., 
•The Biology of Platelet -Derived Growch Factor*, pages 
155-169, see che encire document. 



Proc. Natl. Acad. Sci. USA, Volume 86, -issued July 
1989, Claesson-Welsh et al., "cDNA cloning and 
expression of the human A- type platelet -derived growth 
~ receptor establishes structural similarity to 
receptor-, pages 4917-4921, see the 



1-30 



1-30 



rac 

che B-type PDGF 
entire document. 



Nautre, Volume 323', issued 13 September 1986, Yarden et 
al., -Structure of the receptor for platelet-derived 
growth factor helps define a family of closely related 
growth factor receptors", pages. 226-232, see the entire 
document . 



1-30 



Special categones of crted documents : ' • 
"A" document defining the general state of the art which is 

not considered to be of particular relevance 
"V earlier document but published on or after the 

international filing date . 
V document which may throw doubts on pnonty claimls) 
or which is cited to establish tha publication data of 
another citation or other special reason (as speofiedl 
*0* document referring to an oral disclosure, use. exhibition 
or other means 

document published pnor to the international filing date 



*T* later document published after the international filing 
date or pnonty date end not in conflict with the 
application but cited to understand the pnnciple or 
theory underlying the invention 

"X" document of particular relevance: the claimed 
invention cannot be considered novel or cannot be 
considered to involve an inventive step 

"Y" document of particular relevance; the claimed 
invention cannot be considered to involve an 
inventive step when the document is combined with 
one or more other such documents, such combination 
being obvious to a person skilled in the an 



------ ai aocumeni memoer or rne same parent ramny^ 

IV. CERTIFICATION 


Date of the Actual Completion of the International Search* 

23 APRIL 1992 


Oate of Mailing of this International Search Report 2 

1 2 MAY 1992 


International Searching Authority : 

ISA/ITS ^ 


Signature ^AuT^na^d^Ofhc^ 2 ^ ^ j 
( tTAN WANG | 



Form PCT/lSA/210 Isecond sheeDIMay 1986) 0 





International Application Ho, pct/us9z/oo730 



FVrtTHER INFORMATION CONTINUED FROM PREVIOUS SHEETS 



I. CLASSIFICATION OF SUBJECT MATTES : 
IPC ;S> : 



C ZfR IS/12. iVQO; C 07 K 3/00, U/:5. 15/CO, 17/30; A 01 M 1/32; C 12 Q 1/03, 
1,63; C12 N S/00 



71. :3SSRVATI3N3 WHERE CNI7Y Cc Ur.'SXTlCiT WAS LACKING 
This ISA found multiple inventions as follows : 

I . Claims 1-22 and 25-23 are, drawn :c a method for measuring the PDGF-0 ligand cy 
using a placelec -derived growth factor receptor fragment and its DNA sequence. 
:i. Claims 23-24 are, drawn to an antibody. 

ZZZ. Claims 29-33 are, drawn to a method ftr measuring the PDGF ligand content of a 



oiological sample. 



Form PCT-1SA.210 (continuation sheet OH Oct 1391)1 a 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 



J£ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

^□"COLOR OR BLACK AND WHITE PHOTOGRAPHS 
/□"GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 




BLACK BORDERS 



